Search | arXiv e-print repository

Understanding and Mitigating Harmful Design in User-Generated Virtual Worlds

Authors: Zinan Zhang, Xinning Gui, Yubo Kou

Abstract: Virtual space offers innovative ways for individuals to engage with one another in a digital setting. Prominent virtual social platforms, such as Facebook Spaces, VR Chat, and AltspaceVR, facilitate social connections, allowing users to interact seamlessly. Additionally, certain video games, like Second Life and World of Warcraft, are set within these virtual spaces as well, providing immersive pl… ▽ More Virtual space offers innovative ways for individuals to engage with one another in a digital setting. Prominent virtual social platforms, such as Facebook Spaces, VR Chat, and AltspaceVR, facilitate social connections, allowing users to interact seamlessly. Additionally, certain video games, like Second Life and World of Warcraft, are set within these virtual spaces as well, providing immersive player experiences. As the popularity of virtual space grows, various companies have begun to democratize the process of creating these spaces, shifting the development from skilled professionals to hobbyist creators. Platforms like Minecraft, Roblox, and RecRoom enable users to create and publish their own virtual environments, hosting a wide range of interactions and narratives. This shift echoes the rise of user-generated content, where content creators create and publish content on platforms, such as social media platforms [6]. For example, YouTubers upload videos on YouTube and Reddit users post text-based content on Reddit. For a long time, user-generated content has predominantly contained text, videos, and images. However, with the emergence of virtual spaces, some platforms now allow creators to create and publish their own virtual spaces, leading to the emergence of user-generated virtual worlds. △ Less

Submitted 23 April, 2024; originally announced May 2024.

Comments: This is an accepted position statement of CHI 2024 Workshop (Novel Approaches for Understanding and Mitigating Emerging New Harms in Immersive and Embodied Virtual Spaces: A Workshop at CHI 2024)

arXiv:2404.19134 [pdf, other]

Evaluating Deep Clustering Algorithms on Non-Categorical 3D CAD Models

Authors: Siyuan Xiang, Chin Tseng, Congcong Wen, Deshana Desai, Yifeng Kou, Binil Starly, Daniele Panozzo, Chen Feng

Abstract: We introduce the first work on benchmarking and evaluating deep clustering algorithms on large-scale non-categorical 3D CAD models. We first propose a workflow to allow expert mechanical engineers to efficiently annotate 252,648 carefully sampled pairwise CAD model similarities, from a subset of the ABC dataset with 22,968 shapes. Using seven baseline deep clustering methods, we then investigate t… ▽ More We introduce the first work on benchmarking and evaluating deep clustering algorithms on large-scale non-categorical 3D CAD models. We first propose a workflow to allow expert mechanical engineers to efficiently annotate 252,648 carefully sampled pairwise CAD model similarities, from a subset of the ABC dataset with 22,968 shapes. Using seven baseline deep clustering methods, we then investigate the fundamental challenges of evaluating clustering methods for non-categorical data. Based on these challenges, we propose a novel and viable ensemble-based clustering comparison approach. This work is the first to directly target the underexplored area of deep clustering algorithms for 3D shapes, and we believe it will be an important building block to analyze and utilize the massive 3D shape collections that are starting to appear in deep geometric computing. △ Less

Submitted 29 April, 2024; originally announced April 2024.

arXiv:2404.12376 [pdf, other]

Matching the Statistical Query Lower Bound for k-sparse Parity Problems with Stochastic Gradient Descent

Authors: Yiwen Kou, Zixiang Chen, Quanquan Gu, Sham M. Kakade

Abstract: The $k$-parity problem is a classical problem in computational complexity and algorithmic theory, serving as a key benchmark for understanding computational classes. In this paper, we solve the $k$-parity problem with stochastic gradient descent (SGD) on two-layer fully-connected neural networks. We demonstrate that SGD can efficiently solve the $k$-sparse parity problem on a $d$-dimensional hyper… ▽ More The $k$-parity problem is a classical problem in computational complexity and algorithmic theory, serving as a key benchmark for understanding computational classes. In this paper, we solve the $k$-parity problem with stochastic gradient descent (SGD) on two-layer fully-connected neural networks. We demonstrate that SGD can efficiently solve the $k$-sparse parity problem on a $d$-dimensional hypercube ($k\le O(\sqrt{d})$) with a sample complexity of $\tilde{O}(d^{k-1})$ using $2^{Θ(k)}$ neurons, thus matching the established $Ω(d^{k})$ lower bounds of Statistical Query (SQ) models. Our theoretical analysis begins by constructing a good neural network capable of correctly solving the $k$-parity problem. We then demonstrate how a trained neural network with SGD can effectively approximate this good network, solving the $k$-parity problem with small statistical errors. Our theoretical results and findings are supported by empirical evidence, showcasing the efficiency and efficacy of our approach. △ Less

Submitted 18 April, 2024; originally announced April 2024.

Comments: 36 pages, 7 figures, 3 tables

arXiv:2404.12314 [pdf, other]

Guided Discrete Diffusion for Electronic Health Record Generation

Authors: Jun Han, Zixiang Chen, Yongqian Li, Yiwen Kou, Eran Halperin, Robert E. Tillman, Quanquan Gu

Abstract: Electronic health records (EHRs) are a pivotal data source that enables numerous applications in computational medicine, e.g., disease progression prediction, clinical trial design, and health economics and outcomes research. Despite wide usability, their sensitive nature raises privacy and confidentially concerns, which limit potential use cases. To tackle these challenges, we explore the use of… ▽ More Electronic health records (EHRs) are a pivotal data source that enables numerous applications in computational medicine, e.g., disease progression prediction, clinical trial design, and health economics and outcomes research. Despite wide usability, their sensitive nature raises privacy and confidentially concerns, which limit potential use cases. To tackle these challenges, we explore the use of generative models to synthesize artificial, yet realistic EHRs. While diffusion-based methods have recently demonstrated state-of-the-art performance in generating other data modalities and overcome the training instability and mode collapse issues that plague previous GAN-based approaches, their applications in EHR generation remain underexplored. The discrete nature of tabular medical code data in EHRs poses challenges for high-quality data generation, especially for continuous diffusion models. To this end, we introduce a novel tabular EHR generation method, EHR-D3PM, which enables both unconditional and conditional generation using the discrete diffusion model. Our experiments demonstrate that EHR-D3PM significantly outperforms existing generative baselines on comprehensive fidelity and utility metrics while maintaining less attribute and membership vulnerability risks. Furthermore, we show EHR-D3PM is effective as a data augmentation method and enhances performance on downstream tasks when combined with real data. △ Less

Submitted 14 June, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

Comments: 26 pages, 9 figures, 9 tables

arXiv:2404.12210 [pdf, other]

An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training

Authors: ** Gao, Shubo Lin, Shaoru Wang, Yutong Kou, Zeming Li, Liang Li, Congxuan Zhang, Xiaoqin Zhang, Yizheng Wang, Weiming Hu

Abstract: Masked image modeling (MIM) pre-training for large-scale vision transformers (ViTs) has enabled promising downstream performance on top of the learned self-supervised ViT features. In this paper, we question if the \textit{extremely simple} lightweight ViTs' fine-tuning performance can also benefit from this pre-training paradigm, which is considerably less studied yet in contrast to the well-esta… ▽ More Masked image modeling (MIM) pre-training for large-scale vision transformers (ViTs) has enabled promising downstream performance on top of the learned self-supervised ViT features. In this paper, we question if the \textit{extremely simple} lightweight ViTs' fine-tuning performance can also benefit from this pre-training paradigm, which is considerably less studied yet in contrast to the well-established lightweight architecture design methodology. We use an observation-analysis-solution flow for our study. We first systematically observe different behaviors among the evaluated pre-training methods with respect to the downstream fine-tuning data scales. Furthermore, we analyze the layer representation similarities and attention maps across the obtained models, which clearly show the inferior learning of MIM pre-training on higher layers, leading to unsatisfactory transfer performance on data-insufficient downstream tasks. This finding is naturally a guide to designing our distillation strategies during pre-training to solve the above deterioration problem. Extensive experiments have demonstrated the effectiveness of our approach. Our pre-training with distillation on pure lightweight ViTs with vanilla/hierarchical design ($5.7M$/$6.5M$) can achieve $79.4\%$/$78.9\%$ top-1 accuracy on ImageNet-1K. It also enables SOTA performance on the ADE20K segmentation task ($42.8\%$ mIoU) and LaSOT tracking task ($66.1\%$ AUC) in the lightweight regime. The latter even surpasses all the current SOTA lightweight CPU-realtime trackers. △ Less

Submitted 25 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

Comments: A submission to IJCV

arXiv:2404.11952 [pdf, other]

Generation of Ultrarelativistic Vortex Leptons with Large Orbital Angular Momenta

Authors: Mamutjan Ababekri, Jun-Lin Zhou, Ren-Tong Guo, Yong-Zheng Ren, Yu-Han Kou, Qian Zhao, Zhong-Peng Li, Jian-Xing Li

Abstract: Ultrarelativistic vortex leptons with intrinsic orbital angular momenta (OAM) have important applications in high energy particle physics, nuclear physics, astrophysics, etc. However, unfortunately, their generation still poses a great challenge. Here, we put forward a novel method for generating ultrarelativistic vortex positrons and electrons through nonlinear Breit-Wheeler (NBW) scattering of v… ▽ More Ultrarelativistic vortex leptons with intrinsic orbital angular momenta (OAM) have important applications in high energy particle physics, nuclear physics, astrophysics, etc. However, unfortunately, their generation still poses a great challenge. Here, we put forward a novel method for generating ultrarelativistic vortex positrons and electrons through nonlinear Breit-Wheeler (NBW) scattering of vortex $γ$ photons. For the first time, a complete angular momentum-resolved scattering theory has been formulated, introducing the angular momentum of laser photons and vortex particles into the conventional NBW scattering framework. We find that vortex positron (electron) can be produced when the outgoing electron (positron) is generated along the collision axis. By unveiling the angular momentum transfer mechanism, we clarify that OAM of the $γ$ photon and angular momenta of multiple laser photons are entirely transferred to the generated pairs, leading to the production of ultrarelativistic vortex positrons or electrons with large OAM. Furthermore, we find that the cone opening angle and superposition state of the vortex $γ$ photon, distinct characteristics aside from its intrinsic OAM, can be determined via the angular distribution of created pairs in NBW processes. Our method paves the way for investigating strong-field quantum electrodynamics processes concerning the generation and detection of vortex particle beams in intense lasers. △ Less

Submitted 24 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

Comments: 5

arXiv:2403.19436 [pdf, other]

"At the end of the day, I am accountable": Gig Workers' Self-Tracking for Multi-Dimensional Accountability Management

Authors: Rie Helene Hernandez, Qiurong Song, Yubo Kou, Xinning Gui

Abstract: Tracking is inherent in and central to the gig economy. Platforms track gig workers' performance through metrics such as acceptance rate and punctuality, while gig workers themselves engage in self-tracking. Although prior research has extensively examined how gig platforms track workers through metrics -- with some studies briefly acknowledging the phenomenon of self-tracking among workers -- the… ▽ More Tracking is inherent in and central to the gig economy. Platforms track gig workers' performance through metrics such as acceptance rate and punctuality, while gig workers themselves engage in self-tracking. Although prior research has extensively examined how gig platforms track workers through metrics -- with some studies briefly acknowledging the phenomenon of self-tracking among workers -- there is a dearth of studies that explore how and why gig workers track themselves. To address this, we conducted 25 semi-structured interviews, revealing how gig workers self-tracking to manage accountabilities to themselves and external entities across three identities: the holistic self, the entrepreneurial self, and the platformized self. We connect our findings to neoliberalism, through which we contextualize gig workers' self-accountability and the invisible labor of self-tracking. We further discuss how self-tracking mitigates information and power asymmetries in gig work and offer design implications to support gig workers' multi-dimensional self-tracking. △ Less

Submitted 28 March, 2024; originally announced March 2024.

Comments: Accepted to CHI 2024

arXiv:2312.09193 [pdf, other]

Fast Sampling via Discrete Non-Markov Diffusion Models

Authors: Zixiang Chen, Huizhuo Yuan, Yongqian Li, Yiwen Kou, Junkai Zhang, Quanquan Gu

Abstract: Discrete diffusion models have emerged as powerful tools for high-quality data generation. Despite their success in discrete spaces, such as text generation tasks, the acceleration of discrete diffusion models remains under explored. In this paper, we propose a discrete non-Markov diffusion model, which admits an accelerated reverse sampling for discrete data generation. Our method significantly r… ▽ More Discrete diffusion models have emerged as powerful tools for high-quality data generation. Despite their success in discrete spaces, such as text generation tasks, the acceleration of discrete diffusion models remains under explored. In this paper, we propose a discrete non-Markov diffusion model, which admits an accelerated reverse sampling for discrete data generation. Our method significantly reduces the number of function evaluations (i.e., calls to the neural network), making the sampling process much faster. Furthermore, we study the transition from finite to infinite step sampling, offering new insights into bridging the gap between discrete and continuous-time processes for discrete diffusion models. Extensive experiments on natural language generation and machine translation tasks demonstrate the superior performance of our method in terms of both generation speed and sample quality compared to existing methods for discrete diffusion models. △ Less

Submitted 27 June, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

Comments: 33 pages, 5 figures, 12 tables

arXiv:2310.18935 [pdf, other]

Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data

Authors: Yiwen Kou, Zixiang Chen, Quanquan Gu

Abstract: The implicit bias towards solutions with favorable properties is believed to be a key reason why neural networks trained by gradient-based optimization can generalize well. While the implicit bias of gradient flow has been widely studied for homogeneous neural networks (including ReLU and leaky ReLU networks), the implicit bias of gradient descent is currently only understood for smooth neural net… ▽ More The implicit bias towards solutions with favorable properties is believed to be a key reason why neural networks trained by gradient-based optimization can generalize well. While the implicit bias of gradient flow has been widely studied for homogeneous neural networks (including ReLU and leaky ReLU networks), the implicit bias of gradient descent is currently only understood for smooth neural networks. Therefore, implicit bias in non-smooth neural networks trained by gradient descent remains an open question. In this paper, we aim to answer this question by studying the implicit bias of gradient descent for training two-layer fully connected (leaky) ReLU neural networks. We showed that when the training data are nearly-orthogonal, for leaky ReLU activation function, gradient descent will find a network with a stable rank that converges to $1$, whereas for ReLU activation function, gradient descent will find a neural network with a stable rank that is upper bounded by a constant. Additionally, we show that gradient descent will find a neural network such that all the training data points have the same normalized margin asymptotically. Experiments on both synthetic and real data backup our theoretical findings. △ Less

Submitted 29 October, 2023; originally announced October 2023.

Comments: 55 pages, 7 figures. In NeurIPS 2023

arXiv:2310.10071 [pdf, other]

ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking

Authors: Yutong Kou, ** Gao, Bing Li, Gang Wang, Weiming Hu, Yizheng Wang, Liang Li

Abstract: Recently, the transformer has enabled the speed-oriented trackers to approach state-of-the-art (SOTA) performance with high-speed thanks to the smaller input size or the lighter feature extraction backbone, though they still substantially lag behind their corresponding performance-oriented versions. In this paper, we demonstrate that it is possible to narrow or even close this gap while achieving… ▽ More Recently, the transformer has enabled the speed-oriented trackers to approach state-of-the-art (SOTA) performance with high-speed thanks to the smaller input size or the lighter feature extraction backbone, though they still substantially lag behind their corresponding performance-oriented versions. In this paper, we demonstrate that it is possible to narrow or even close this gap while achieving high tracking speed based on the smaller input size. To this end, we non-uniformly resize the cropped image to have a smaller input size while the resolution of the area where the target is more likely to appear is higher and vice versa. This enables us to solve the dilemma of attending to a larger visual field while retaining more raw information for the target despite a smaller input size. Our formulation for the non-uniform resizing can be efficiently solved through quadratic programming (QP) and naturally integrated into most of the crop-based local trackers. Comprehensive experiments on five challenging datasets based on two kinds of transformer trackers, \ie, OSTrack and TransT, demonstrate consistent improvements over them. In particular, applying our method to the speed-oriented version of OSTrack even outperforms its performance-oriented counterpart by 0.6% AUC on TNL2K, while running 50% faster and saving over 55% MACs. Codes and models are available at https://github.com/Kou-99/ZoomTrack. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: 19 pages, 7 figures, Accepted by NeurIPS 2023 as a Spotlight

arXiv:2310.07269 [pdf, other]

Why Does Sharpness-Aware Minimization Generalize Better Than SGD?

Authors: Zixiang Chen, Junkai Zhang, Yiwen Kou, Xiangning Chen, Cho-Jui Hsieh, Quanquan Gu

Abstract: The challenge of overfitting, in which the model memorizes the training data and fails to generalize to test data, has become increasingly significant in the training of large neural networks. To tackle this challenge, Sharpness-Aware Minimization (SAM) has emerged as a promising training method, which can improve the generalization of neural networks even in the presence of label noise. However,… ▽ More The challenge of overfitting, in which the model memorizes the training data and fails to generalize to test data, has become increasingly significant in the training of large neural networks. To tackle this challenge, Sharpness-Aware Minimization (SAM) has emerged as a promising training method, which can improve the generalization of neural networks even in the presence of label noise. However, a deep understanding of how SAM works, especially in the setting of nonlinear neural networks and classification tasks, remains largely missing. This paper fills this gap by demonstrating why SAM generalizes better than Stochastic Gradient Descent (SGD) for a certain data model and two-layer convolutional ReLU networks. The loss landscape of our studied problem is nonsmooth, thus current explanations for the success of SAM based on the Hessian information are insufficient. Our result explains the benefits of SAM, particularly its ability to prevent noise learning in the early stages, thereby facilitating more effective learning of features. Experiments on both synthetic and real data corroborate our theory. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Comments: 52 pages, 4 figures, 2 tables. In NeurIPS 2023

arXiv:2304.09010 [pdf, other]

Causal Flow-based Variational Auto-Encoder for Disentangled Causal Representation Learning

Authors: Di Fan, Yannian Kou, Chuanhou Gao

Abstract: Disentangled representation learning aims to learn low-dimensional representations of data, where each dimension corresponds to an underlying generative factor. Currently, Variational Auto-Encoder (VAE) are widely used for disentangled representation learning, with the majority of methods assuming independence among generative factors. However, in real-world scenarios, generative factors typically… ▽ More Disentangled representation learning aims to learn low-dimensional representations of data, where each dimension corresponds to an underlying generative factor. Currently, Variational Auto-Encoder (VAE) are widely used for disentangled representation learning, with the majority of methods assuming independence among generative factors. However, in real-world scenarios, generative factors typically exhibit complex causal relationships. We thus design a new VAE-based framework named Disentangled Causal Variational Auto-Encoder (DCVAE), which includes a variant of autoregressive flows known as causal flows, capable of learning effective causal disentangled representations. We provide a theoretical analysis of the disentanglement identifiability of DCVAE, ensuring that our model can effectively learn causal disentangled representations. The performance of DCVAE is evaluated on both synthetic and real-world datasets, demonstrating its outstanding capability in achieving causal disentanglement and performing intervention experiments. Moreover, DCVAE exhibits remarkable performance on downstream tasks and has the potential to learn the true causal structure among factors. △ Less

Submitted 8 May, 2024; v1 submitted 18 April, 2023; originally announced April 2023.

Comments: 20 pages, 14 figures

arXiv:2303.04145 [pdf, other]

Benign Overfitting for Two-layer ReLU Convolutional Neural Networks

Authors: Yiwen Kou, Zixiang Chen, Yuanzhou Chen, Quanquan Gu

Abstract: Modern deep learning models with great expressive power can be trained to overfit the training data but still generalize well. This phenomenon is referred to as \textit{benign overfitting}. Recently, a few studies have attempted to theoretically understand benign overfitting in neural networks. However, these works are either limited to neural networks with smooth activation functions or to the ne… ▽ More Modern deep learning models with great expressive power can be trained to overfit the training data but still generalize well. This phenomenon is referred to as \textit{benign overfitting}. Recently, a few studies have attempted to theoretically understand benign overfitting in neural networks. However, these works are either limited to neural networks with smooth activation functions or to the neural tangent kernel regime. How and when benign overfitting can occur in ReLU neural networks remains an open problem. In this work, we seek to answer this question by establishing algorithm-dependent risk bounds for learning two-layer ReLU convolutional neural networks with label-flip** noise. We show that, under mild conditions, the neural network trained by gradient descent can achieve near-zero training loss and Bayes optimal test risk. Our result also reveals a sharp transition between benign and harmful overfitting under different conditions on data distribution in terms of test risk. Experiments on synthetic data back up our theory. △ Less

Submitted 3 November, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

Comments: 45 pages, 3 figures, 2 tables. In ICML 2023

arXiv:2212.08318 [pdf, other]

doi 10.1038/s41467-022-35377-0

Microwave Imaging of Quasi-periodic Pulsations at Flare Current Sheet

Authors: Yuankun Kou, Xin Cheng, Yulei Wang, Sijie Yu, Bin Chen, Eduard P. Kontar, Mingde Ding

Abstract: Quasi-periodic pulsations (QPPs) are frequently detected in solar and stellar flares, but the underlying physical mechanisms are still to be ascertained. Here, we show microwave QPPs during a solar flare originating from quasi-periodic magnetic reconnection at the flare current sheet. They appear as two vertically detached but closely related sources with the brighter ones located at flare loops a… ▽ More Quasi-periodic pulsations (QPPs) are frequently detected in solar and stellar flares, but the underlying physical mechanisms are still to be ascertained. Here, we show microwave QPPs during a solar flare originating from quasi-periodic magnetic reconnection at the flare current sheet. They appear as two vertically detached but closely related sources with the brighter ones located at flare loops and the weaker ones along the stretched current sheet. Although the brightness temperatures of the two microwave sources differ greatly, they vary in phase with periods of about 10--20 s and 30--60 s. The gyrosynchrotron-dominated microwave spectra also present a quasi-periodic soft-hard-soft evolution. These results suggest that relevant high-energy electrons are accelerated by quasi-periodic reconnection, likely arising from the modulation of magnetic islands within the current sheet as validated by a 2.5-dimensional magnetohydrodynamic simulation. △ Less

Submitted 16 December, 2022; originally announced December 2022.

Journal ref: Nature Communications (2022) 13:7680

arXiv:2112.14016 [pdf, other]

doi 10.1109/TPAMI.2022.3156977

Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking

Authors: ** Gao, Yan Lu, Xiaojuan Qi, Yutong Kou, Bing Li, Liang Li, Shan Yu, Weiming Hu

Abstract: Tracking visual objects from a single initial exemplar in the testing phase has been broadly cast as a one-/few-shot problem, i.e., one-shot learning for initial adaptation and few-shot learning for online adaptation. The recent few-shot online adaptation methods incorporate the prior knowledge from large amounts of annotated training data via complex meta-learning optimization in the offline phas… ▽ More Tracking visual objects from a single initial exemplar in the testing phase has been broadly cast as a one-/few-shot problem, i.e., one-shot learning for initial adaptation and few-shot learning for online adaptation. The recent few-shot online adaptation methods incorporate the prior knowledge from large amounts of annotated training data via complex meta-learning optimization in the offline phase. This helps the online deep trackers to achieve fast adaptation and reduce overfitting risk in tracking. In this paper, we propose a simple yet effective recursive least-squares estimator-aided online learning approach for few-shot online adaptation without requiring offline training. It allows an in-built memory retention mechanism for the model to remember the knowledge about the object seen before, and thus the seen data can be safely removed from training. This also bears certain similarities to the emerging continual learning field in preventing catastrophic forgetting. This mechanism enables us to unveil the power of modern online deep trackers without incurring too much extra computational cost. We evaluate our approach based on two networks in the online learning families for tracking, i.e., multi-layer perceptrons in RT-MDNet and convolutional neural networks in DiMP. The consistent improvements on several challenging tracking benchmarks demonstrate its effectiveness and efficiency. △ Less

Submitted 10 March, 2022; v1 submitted 28 December, 2021; originally announced December 2021.

Comments: Accepted by TPAMI. Extended version of the RLS-RTMDNet tracker (CVPR2020)

arXiv:2101.04794 [pdf]

doi 10.13140/RG.2.2.33097.98403

The Medical Authority of AI: A Study of AI-enabled Consumer-facing Health Technology

Authors: Yue You, Yubo Kou, Xianghua Ding, Xinning Gui

Abstract: Recently, consumer-facing health technologies such as Artificial Intelligence (AI)-based symptom checkers (AISCs) have sprung up in everyday healthcare practice. AISCs solicit symptom information from users and provide medical suggestions and possible diagnoses, a responsibility that people usually entrust with real-person authorities such as physicians and expert patients. Thus, the advent of AIS… ▽ More Recently, consumer-facing health technologies such as Artificial Intelligence (AI)-based symptom checkers (AISCs) have sprung up in everyday healthcare practice. AISCs solicit symptom information from users and provide medical suggestions and possible diagnoses, a responsibility that people usually entrust with real-person authorities such as physicians and expert patients. Thus, the advent of AISCs begs a question of whether and how they transform the notion of medical authority in everyday healthcare practice. To answer this question, we conducted an interview study with thirty AISC users. We found that users assess the medical authority of AISCs using various factors including automated decisions and interaction design patterns of AISC apps, associations with established medical authorities like hospitals, and comparisons with other health technologies. We reveal how AISCs are used in healthcare delivery, discuss how AI transforms conventional understandings of medical authority, and derive implications for designing AI-enabled health technology. △ Less

Submitted 12 January, 2021; originally announced January 2021.

arXiv:2101.02338 [pdf, other]

Max-Affine Spline Insights Into Deep Network Pruning

Authors: Haoran You, Randall Balestriero, Zhihan Lu, Yutong Kou, Huihong Shi, Shunyao Zhang, Shang Wu, Yingyan Lin, Richard Baraniuk

Abstract: In this paper, we study the importance of pruning in Deep Networks (DNs) and the yin & yang relationship between (1) pruning highly overparametrized DNs that have been trained from random initialization and (2) training small DNs that have been "cleverly" initialized. As in most cases practitioners can only resort to random initialization, there is a strong need to develop a grounded understanding… ▽ More In this paper, we study the importance of pruning in Deep Networks (DNs) and the yin & yang relationship between (1) pruning highly overparametrized DNs that have been trained from random initialization and (2) training small DNs that have been "cleverly" initialized. As in most cases practitioners can only resort to random initialization, there is a strong need to develop a grounded understanding of DN pruning. Current literature remains largely empirical, lacking a theoretical understanding of how pruning affects DNs' decision boundary, how to interpret pruning, and how to design corresponding principled pruning techniques. To tackle those questions, we propose to employ recent advances in the theoretical analysis of Continuous Piecewise Affine (CPA) DNs. From this perspective, we will be able to detect the early-bird (EB) ticket phenomenon, provide interpretability into current pruning techniques, and develop a principled pruning strategy. In each step of our study, we conduct extensive experiments supporting our claims and results; while our main goal is to enhance the current understanding towards DN pruning instead of develo** a new pruning method, our spline pruning criteria in terms of layerwise and global pruning is on par with or even outperforms state-of-the-art pruning methods. △ Less

Submitted 18 August, 2022; v1 submitted 6 January, 2021; originally announced January 2021.

Comments: Accepted by TMLR

arXiv:2008.08202 [pdf]

doi 10.1145/3415173

Mediating Community-AI Interaction through Situated Explanation: The Case of AI-Led Moderation

Authors: Yubo Kou, Xinning Gui

Abstract: Artificial intelligence (AI) has become prevalent in our everyday technologies and impacts both individuals and communities. The explainable AI (XAI) scholarship has explored the philosophical nature of explanation and technical explanations, which are usually driven by experts in lab settings and can be challenging for laypersons to understand. In addition, existing XAI research tends to focus on… ▽ More Artificial intelligence (AI) has become prevalent in our everyday technologies and impacts both individuals and communities. The explainable AI (XAI) scholarship has explored the philosophical nature of explanation and technical explanations, which are usually driven by experts in lab settings and can be challenging for laypersons to understand. In addition, existing XAI research tends to focus on the individual level. Little is known about how people understand and explain AI-led decisions in the community context. Drawing from XAI and activity theory, a foundational HCI theory, we theorize how explanation is situated in a community's shared values, norms, knowledge, and practices, and how situated explanation mediates community-AI interaction. We then present a case study of AI-led moderation, where community members collectively develop explanations of AI-led decisions, most of which are automated punishments. Lastly, we discuss the implications of this framework at the intersection of CSCW, HCI, and XAI. △ Less

Submitted 18 August, 2020; originally announced August 2020.

Journal ref: PACMHCI, Vol 4, No. CSCW2, Article 102 (October 2020). 27 pages

arXiv:2007.03852 [pdf, ps, other]

doi 10.3847/2041-8213/aba362

What determine Solar Flares Producing Interplanetary Type III Radio Bursts?

Authors: Y. K. Kou, Z. C. **g, X. Cheng, W. Q. Pan, Y. Liu, C. Li, M. D. Ding

Abstract: Energetic electrons accelerated by solar flares often give rise to type III radio bursts at a broad waveband and even interplanetary type III bursts (IT3) if the wavelength extends to decameter-kilometer. In this Letter, we investigate the probability of the flares that produce IT3, based on the sample of 2272 flares above M-class observed from 1996 to 2016. It is found that only 49.6% of the flar… ▽ More Energetic electrons accelerated by solar flares often give rise to type III radio bursts at a broad waveband and even interplanetary type III bursts (IT3) if the wavelength extends to decameter-kilometer. In this Letter, we investigate the probability of the flares that produce IT3, based on the sample of 2272 flares above M-class observed from 1996 to 2016. It is found that only 49.6% of the flares are detected to be accompanied with IT3. The duration, peak flux, and fluence of the flares with and without IT3 both present power-law distributions in the frequency domain, but the corresponding spectral indices for the former (2.06$\pm$0.17, 2.04$\pm$0.18, and 1.55$\pm$0.09) are obviously smaller than that for the latter (2.82$\pm$0.22, 2.51$\pm$0.19, and 2.40$\pm$0.09), showing that the flares with IT3 have longer durations and higher peak fluxes. We further examine the relevance of coronal mass ejections (CMEs) to the two groups of flares. It is found that 58% (655 of 1127) of the flares with IT3 but only 19% (200 of 1078) of the flares without IT3 are associated with CMEs, and that the associated CMEs for the flares with IT3 are inclined to be wider and faster. This indicates that CMEs may also play a role in producing IT3, speculatively facilitating the escape of accelerated electrons from the low corona to the interplanetary space. △ Less

Submitted 7 July, 2020; originally announced July 2020.

Comments: accepted for publication in ApJ Letters, 5 figures and one table

arXiv:1607.04827 [pdf]

Forced acoustical response of an open cavity coupled with a semi-infinite space

Authors: Yuhui Tong, Yiwei Kou, Jie Pan

Abstract: This paper presents a study of the forced acoustical response of an open cavity from the perspective of modal expansion. Based on the coupled mode theory, it is shown that the sound pressure distribution of an open cavity excited by a point source placed within the cavity can be expanded by a set of frequency-dependent eigenmodes, which are derived from the coupling between the cavity and a semi-i… ▽ More This paper presents a study of the forced acoustical response of an open cavity from the perspective of modal expansion. Based on the coupled mode theory, it is shown that the sound pressure distribution of an open cavity excited by a point source placed within the cavity can be expanded by a set of frequency-dependent eigenmodes, which are derived from the coupling between the cavity and a semi-infinite space. The calculation of the acoustical responses for baffled and unbaffled open cavities indicates that the proposed modal expansion converges with only a few frequency-dependent eigenmodes in the frequency range of interest. The results of this study eliminate the ambiguity involving the selection of appropriate basis functions, in modal expansion for the forced response problem in open cavities. △ Less

Submitted 17 July, 2016; originally announced July 2016.

Comments: 18 pages, 12 figures, 2 tables

arXiv:1302.5764 [pdf]

doi 10.1364/OL.38.001271

Multiband Vector Plasmonic Lattice Solitons

Authors: Yao Kou, Fangwei Ye, Xianfeng Chen

Abstract: We predict multiband vector Plasmonic Lattice Solitons (PLSs) in metal-dielectric waveguide arrays, in both focusing and defocusing nonlinearities. Such vector solitons consist of two components originating from different transmission bands. By simulating the full nonlinear Maxwell equations (MEs), we demonstrate the diffractionless propagation of vector PLSs and their discrete diffraction when on… ▽ More We predict multiband vector Plasmonic Lattice Solitons (PLSs) in metal-dielectric waveguide arrays, in both focusing and defocusing nonlinearities. Such vector solitons consist of two components originating from different transmission bands. By simulating the full nonlinear Maxwell equations (MEs), we demonstrate the diffractionless propagation of vector PLSs and their discrete diffraction when only one component is present. Their subwavelength size characteristics and the influences of metallic losses are also studied. △ Less

Submitted 6 March, 2013; v1 submitted 23 February, 2013; originally announced February 2013.

Comments: 4 pages, 4 figures, Version2.0

Journal ref: Opt. Lett. 38, 1271(2013)

arXiv:1208.2310 [pdf]

doi 10.1364/OL.38.000253

Surface Plasmonic Lattice Solitons

Authors: Yao Kou, Fangwei Ye, Xianfeng Chen

Abstract: We reveal the existence of the surface plasmonic lattice solitons (surface PLSs) at the boundary of a semi-infinite metallic-dielectric periodic nano-structure. We find that the truncation of the periodic structure imposes a threshold power for the existence of surface PLSs, and enhances significantly the modal localization. The propagation and excitation of surface PLSs as well as their potential… ▽ More We reveal the existence of the surface plasmonic lattice solitons (surface PLSs) at the boundary of a semi-infinite metallic-dielectric periodic nano-structure. We find that the truncation of the periodic structure imposes a threshold power for the existence of surface PLSs, and enhances significantly the modal localization. The propagation and excitation of surface PLSs as well as their potential application in the all-optical subwavelength switching are also demonstrated. △ Less

Submitted 11 August, 2012; originally announced August 2012.

Comments: 4 pages, 4 figures, to appear in Optics Letters

Journal ref: Optics Letters 37, 3822(2012)

Showing 1–22 of 22 results for author: Kou, Y