Skip to main content

Showing 1–50 of 136 results for author: Wang, Z J

.
  1. arXiv:2407.01972  [pdf, other

    cs.IR cs.AI cs.HC cs.LG

    MeMemo: On-device Retrieval Augmentation for Private and Personalized Text Generation

    Authors: Zijie J. Wang, Duen Horng Chau

    Abstract: Retrieval-augmented text generation (RAG) addresses the common limitations of large language models (LLMs), such as hallucination, by retrieving information from an updatable external knowledge base. However, existing approaches often require dedicated backend servers for data storage and retrieval, thereby limiting their applicability in use cases that require strict data privacy, such as persona… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted to SIGIR 2024. 6 pages, 2 figures. For a live demo, visit https://poloclub.github.io/mememo/. Code is open-source at https://github.com/poloclub/mememo

  2. arXiv:2406.18517  [pdf, other

    quant-ph

    Generalized Concentratable Entanglement via Parallelized Permutation Tests

    Authors: Xiaoyu Liu, Johannes Knörzer, Zherui Jerry Wang, Jordi Tura

    Abstract: Multipartite entanglement is an essential resource for quantum information theory and technologies, but its quantification has been a persistent challenge. Recently, Concentratable Entanglement (CE) has been introduced as a promising candidate for a multipartite entanglement measure, which can be efficiently estimated across two state copies. In this work, we introduce Generalized Concentratable E… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 5 pages + 17 pages appendix; 4 + 2 figures; 1 table

  3. arXiv:2405.03546  [pdf, other

    cs.CV cs.LG

    CCDM: Continuous Conditional Diffusion Models for Image Generation

    Authors: Xin Ding, Yongwei Wang, Kao Zhang, Z. Jane Wang

    Abstract: Continuous Conditional Generative Modeling (CCGM) aims to estimate the distribution of high-dimensional data, typically images, conditioned on scalar continuous variables known as regression labels. While Continuous conditional Generative Adversarial Networks (CcGANs) were initially designed for this task, their adversarial training mechanism remains vulnerable to extremely sparse or imbalanced da… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  4. Data Format Standardization and DICOM Integration for Hyperpolarized 13C MRI

    Authors: Ernesto Diaz, Renuka Sriram, Jeremy W. Gordon, Avantika Sinha, Xiaoxi Liu, Sule Sahin, Jason Crane, Marram P Olson, Hsin-Yu Chen, Jenna Bernard, Daniel B. Vigneron, Zhen Jane Wang, Duan Xu, Peder E. Z. Larson

    Abstract: Hyperpolarized (HP) 13C MRI has shown promise as a valuable modality for in vivo measurements of metabolism and is currently in human trials at 15 research sites worldwide. With this growth it is important to adopt standardized data storage practices as it will allow sites to meaningfully compare data. In this paper we (1) describe data that we believe should be stored and (2) demonstrate pipeli… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  5. arXiv:2404.16069  [pdf, other

    cs.HC cs.AI

    Interactive Visual Learning for Stable Diffusion

    Authors: Seongmin Lee, Benjamin Hoover, Hendrik Strobelt, Zijie J. Wang, ShengYun Peng, Austin Wright, Kevin Li, Haekyu Park, Haoyang Yang, Polo Chau

    Abstract: Diffusion-based generative models' impressive ability to create convincing images has garnered global attention. However, their complex internal structures and operations often pose challenges for non-experts to grasp. We introduce Diffusion Explainer, the first interactive visualization tool designed to elucidate how Stable Diffusion transforms text prompts into images. It tightly integrates a vi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 4 pages, 3 figures. arXiv admin note: substantial text overlap with arXiv:2305.03509

  6. arXiv:2404.01361  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    LLM Attributor: Interactive Visual Attribution for LLM Generation

    Authors: Seongmin Lee, Zijie J. Wang, Aishwarya Chakravarthy, Alec Helbling, ShengYun Peng, Mansi Phute, Duen Horng Chau, Minsuk Kahng

    Abstract: While large language models (LLMs) have shown remarkable capability to generate convincing text across diverse domains, concerns around its potential risks have highlighted the importance of understanding the rationale behind text generation. We present LLM Attributor, a Python library that provides interactive visualizations for training data attribution of an LLM's text generation. Our library o… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 8 pages, 3 figures, For a video demo, see https://youtu.be/mIG2MDQKQxM

  7. arXiv:2403.19754  [pdf, other

    cs.CL

    GOLD: Generalized Knowledge Distillation via Out-of-Distribution-Guided Language Data Generation

    Authors: Mohsen Gholami, Mohammad Akbari, Cindy Hu, Vaden Masrani, Z. Jane Wang, Yong Zhang

    Abstract: Knowledge distillation from LLMs is essential for the efficient deployment of language models. Prior works have proposed data generation using LLMs for preparing distilled models. We argue that generating data with LLMs is prone to sampling mainly from the center of original content distribution. This limitation hinders the distilled model from learning the true underlying data distribution and to… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  8. arXiv:2402.15350  [pdf, other

    cs.HC cs.AI cs.CY cs.LG

    Farsight: Fostering Responsible AI Awareness During AI Application Prototy**

    Authors: Zijie J. Wang, Chinmay Kulkarni, Lauren Wilcox, Michael Terry, Michael Madaio

    Abstract: Prompt-based interfaces for Large Language Models (LLMs) have made prototy** and building AI-powered applications easier than ever before. However, identifying potential harms that may arise from AI applications remains a challenge, particularly during prompt-based prototy**. To address this, we present Farsight, a novel in situ interactive tool that helps people identify potential harms from… ▽ More

    Submitted 2 July, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Accepted to CHI 2024 (Best Paper, Honorable Mention). 40 pages, 19 figures, 5 tables. For a demo video, see https://youtu.be/BlSFbGkOlHk. For a live demo, visit https://PAIR-code.github.io/farsight. The source code is available at https://github.com/PAIR-code/farsight

  9. arXiv:2402.15051  [pdf

    physics.plasm-ph

    Prediction of Fishbone Linear Instability in Tokamaks with Machine Learning Methods

    Authors: Z. Y. Liu, H. R. Qiu, G. Y. Fu, Y. Xiao, Y. C. Chen, Z. J. Wang, Y. X. Wei

    Abstract: A machine learning based surrogate model for fishbone linear instability in tokamaks is constructed. Hybrid simulations with the kinetic-magnetohydrodynamic (MHD) code M3D-K is used to generate the database of fishbone linear instability, through scanning the four key parameters which are thought to determine the fishbone physics. The four key parameters include (1) central total beta of both ther… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 28 pages,19 figures

  10. arXiv:2401.14447  [pdf, other

    cs.HC cs.AI cs.CL cs.LG

    Wordflow: Social Prompt Engineering for Large Language Models

    Authors: Zijie J. Wang, Aishwarya Chakravarthy, David Munechika, Duen Horng Chau

    Abstract: Large language models (LLMs) require well-crafted prompts for effective use. Prompt engineering, the process of designing prompts, is challenging, particularly for non-experts who are less familiar with AI technologies. While researchers have proposed techniques and tools to assist LLM users in prompt design, these works primarily target AI application developers rather than non-experts. To addres… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 8 pages, 7 figures. Wordflow is available at: https://poloclub.github.io/wordflow. The code is available at: https://github.com/poloclub/wordflow/. For a demo video, see: https://youtu.be/3dOcVuofGVo

  11. arXiv:2401.10029  [pdf

    cs.CE q-bio.TO

    Cardiac Digital Twin Pipeline for Virtual Therapy Evaluation

    Authors: Julia Camps, Zhinuo Jenny Wang, Ruben Doste, Maxx Holmes, Brodie Lawson, Jakub Tomek, Kevin Burrage, Alfonso Bueno-Orovio, Blanca Rodriguez

    Abstract: Cardiac digital twins are computational tools capturing key functional and anatomical characteristics of patient hearts for investigating disease phenotypes and predicting responses to therapy. When paired with large-scale computational resources and large clinical datasets, digital twin technology can enable virtual clinical trials on virtual cohorts to fast-track therapy development. Here, we pr… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  12. arXiv:2312.14915  [pdf, other

    cs.CV

    PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF

    Authors: Mohsen Gholami, Rabab Ward, Z. Jane Wang

    Abstract: This paper proposes an end-to-end framework for generating 3D human pose datasets using Neural Radiance Fields (NeRF). Public datasets generally have limited diversity in terms of human poses and camera viewpoints, largely due to the resource-intensive nature of collecting 3D human pose data. As a result, pose estimators trained on public datasets significantly underperform when applied to unseen… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  13. arXiv:2311.13196  [pdf, other

    cs.IT eess.SP stat.ME

    Optimal Time of Arrival Estimation for MIMO Backscatter Channels

    Authors: Chen He, Luyang Han, Z. Jane Wang

    Abstract: In this paper, we propose a novel time of arrival (TOA) estimator for multiple-input-multiple-output (MIMO) backscatter channels in closed form. The proposed estimator refines the estimation precision from the topological structure of the MIMO backscatter channels, and can considerably enhance the estimation accuracy. Particularly, we show that for the general $M \times N$ bistatic topology, the m… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  14. arXiv:2310.12347  [pdf, other

    cs.HC

    VisGrader: Automatic Grading of D3 Visualizations

    Authors: Matthew Hull, Vivian Pednekar, Hannah Murray, Nimisha Roy, Emmanuel Tung, Susanta Routray, Connor Guerin, Justin Chen, Zijie J. Wang, Seongmin Lee, Mahdi Roozbahani, Duen Horng Chau

    Abstract: Manually grading D3 data visualizations is a challenging endeavor, and is especially difficult for large classes with hundreds of students. Grading an interactive visualization requires a combination of interactive, quantitative, and qualitative evaluation that are conventionally done manually and are difficult to scale up as the visualization complexity, data size, and number of students increase… ▽ More

    Submitted 19 October, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

  15. arXiv:2310.12243  [pdf, other

    cs.LG cs.CV

    REVAMP: Automated Simulations of Adversarial Attacks on Arbitrary Objects in Realistic Scenes

    Authors: Matthew Hull, Zijie J. Wang, Duen Horng Chau

    Abstract: Deep Learning models, such as those used in an autonomous vehicle are vulnerable to adversarial attacks where an attacker could place an adversarial object in the environment, leading to mis-classification. Generating these adversarial objects in the digital space has been extensively studied, however successfully transferring these attacks from the digital realm to the physical realm has proven c… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  16. arXiv:2310.05123  [pdf, other

    cs.AI

    Distribution-Based Trajectory Clustering

    Authors: Zi **g Wang, Ye Zhu, Kai Ming Ting

    Abstract: Trajectory clustering enables the discovery of common patterns in trajectory data. Current methods of trajectory clustering rely on a distance measure between two points in order to measure the dissimilarity between two trajectories. The distance measures employed have two challenges: high computational cost and low fidelity. Independent of the distance measure employed, existing clustering algori… ▽ More

    Submitted 30 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  17. arXiv:2306.13740  [pdf

    physics.med-ph q-bio.TO

    Digital Twinning of the Human Ventricular Activation Sequence to Clinical 12-lead ECGs and Magnetic Resonance Imaging Using Realistic Purkinje Networks for in Silico Clinical Trials

    Authors: Julia Camps, Lucas Arantes Berg, Zhinuo Jenny Wang, Rafael Sebastian, Leto Luana Riebel, Ruben Doste, Xin Zhou, Rafael Sachetto, James Coleman, Brodie Lawson, Vicente Grau, Kevin Burrage, Alfonso Bueno-Orovio, Rodrigo Weber, Blanca Rodriguez

    Abstract: Cardiac in silico clinical trials can virtually assess the safety and efficacy of therapies using human-based modelling and simulation. These technologies can provide mechanistic explanations for clinically observed pathological behaviour. Designing virtual cohorts for in silico trials requires exploiting clinical data to capture the physiological variability in the human population. The clinical… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: Paper under revision

  18. arXiv:2306.09328  [pdf, other

    cs.LG cs.CL cs.CV cs.HC

    WizMap: Scalable Interactive Visualization for Exploring Large Machine Learning Embeddings

    Authors: Zijie J. Wang, Fred Hohman, Duen Horng Chau

    Abstract: Machine learning models often learn latent embedding representations that capture the domain semantics of their training data. These embedding representations are valuable for interpreting trained models, building new models, and analyzing new datasets. However, interpreting and using embeddings can be challenging due to their opaqueness, high dimensionality, and the large size of modern datasets.… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 8 pages, 8 figures, Accepted to ACL 2023. For a demo video, see https://youtu.be/8fJG87QVceQ. For a live demo, see https://poloclub.github.io/wizmap. Code is available at https://github.com/poloclub/wizmap

  19. arXiv:2306.06405  [pdf, other

    stat.CO

    Effects of 3D Position Fluctuations on Air-to-Ground mmWave UAV Communications

    Authors: Cunyan Ma, Xiaoya Li, Chen He, **ye Peng, Z. Jane Wang

    Abstract: Millimeter wave (mmWave)-based unmanned aerial vehicle (UAV) communication is a promising candidate for future communications due to its flexibility and sufficient bandwidth. However, random fluctuations in the position of hovering UAVs will lead to random variations in the blockage and signal-to-noise ratio (SNR) of the UAV-user link, thus affecting the quality of service (QoS) of the system. To… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

  20. arXiv:2305.03509  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion

    Authors: Seongmin Lee, Benjamin Hoover, Hendrik Strobelt, Zijie J. Wang, ShengYun Peng, Austin Wright, Kevin Li, Haekyu Park, Haoyang Yang, Duen Horng Chau

    Abstract: Diffusion-based generative models' impressive ability to create convincing images has captured global attention. However, their complex internal structures and operations often make them difficult for non-experts to understand. We present Diffusion Explainer, the first interactive visualization tool that explains how Stable Diffusion transforms text prompts into images. Diffusion Explainer tightly… ▽ More

    Submitted 8 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: 5 pages, 5 figures

  21. SuperNOVA: Design Strategies and Opportunities for Interactive Visualization in Computational Notebooks

    Authors: Zijie J. Wang, David Munechika, Seongmin Lee, Duen Horng Chau

    Abstract: Computational notebooks, such as Jupyter Notebook, have become data scientists' de facto programming environments. Many visualization researchers and practitioners have developed interactive visualization tools that support notebooks, yet little is known about the appropriate design of these tools. To address this critical research gap, we investigate the design strategies in this space by analyzi… ▽ More

    Submitted 28 March, 2024; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted at CHI 2024 (Late-Breaking Work). 17 pages, 11 figures, 1 table. SuperNOVA is available at: http://poloclub.github.io/supernova/. The code is available at: https://github.com/poloclub/supernova

  22. arXiv:2304.05967  [pdf, other

    cs.HC cs.AI cs.CL cs.LG

    Angler: Hel** Machine Translation Practitioners Prioritize Model Improvements

    Authors: Samantha Robertson, Zijie J. Wang, Dominik Moritz, Mary Beth Kery, Fred Hohman

    Abstract: Machine learning (ML) models can fail in unexpected ways in the real world, but not all model failures are equal. With finite time and resources, ML practitioners are forced to prioritize their model debugging and improvement efforts. Through interviews with 13 ML practitioners at Apple, we found that practitioners construct small targeted test sets to estimate an error's nature, scope, and impact… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Accepted to CHI 2023. 20 pages, 6 figures

  23. Queer In AI: A Case Study in Community-Led Participatory AI

    Authors: Organizers Of QueerInAI, :, Anaelia Ovalle, Arjun Subramonian, Ashwin Singh, Claas Voelcker, Danica J. Sutherland, Davide Locatelli, Eva Breznik, Filip Klubička, Hang Yuan, Hetvi J, Huan Zhang, Jaidev Shriram, Kruno Lehman, Luca Soldaini, Maarten Sap, Marc Peter Deisenroth, Maria Leonor Pacheco, Maria Ryskina, Martin Mundt, Milind Agarwal, Nyx McLean, Pan Xu, A Pranav , et al. (26 additional authors not shown)

    Abstract: We present Queer in AI as a case study for community-led participatory design in AI. We examine how participatory design and intersectional tenets started and shaped this community's programs over the years. We discuss different challenges that emerged in the process, look at ways this organization has fallen short of operationalizing participatory and intersectional principles, and then assess th… ▽ More

    Submitted 8 June, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: To appear at FAccT 2023

    Journal ref: 2023 ACM Conference on Fairness, Accountability, and Transparency

  24. arXiv:2303.09545  [pdf, other

    cs.LG cs.AI cs.HC

    WebSHAP: Towards Explaining Any Machine Learning Models Anywhere

    Authors: Zijie J. Wang, Duen Horng Chau

    Abstract: As machine learning (ML) is increasingly integrated into our everyday Web experience, there is a call for transparent and explainable web-based ML. However, existing explainability techniques often require dedicated backend servers, which limit their usefulness as the Web community moves toward in-browser ML for lower latency and greater privacy. To address the pressing need for a client-side expl… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: 5 pages, 4 figures. Accepted at the ACM Web Conference 2023 (WWW 2023). For a live demo, visit https://poloclub.github.io/webshap/. Code is open-source at https://github.com/poloclub/webshap

  25. arXiv:2302.14165  [pdf, other

    cs.LG cs.AI cs.HC

    GAM Coach: Towards Interactive and User-centered Algorithmic Recourse

    Authors: Zijie J. Wang, Jennifer Wortman Vaughan, Rich Caruana, Duen Horng Chau

    Abstract: Machine learning (ML) recourse techniques are increasingly used in high-stakes domains, providing end users with actions to alter ML predictions, but they assume ML developers understand what input variables can be changed. However, a recourse plan's actionability is subjective and unlikely to match developers' expectations completely. We present GAM Coach, a novel open-source system that adapts i… ▽ More

    Submitted 28 February, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted to CHI 2023. 20 pages, 12 figures. For a demo video, see https://youtu.be/ubacP34H9XE. For a live demo, visit https://poloclub.github.io/gam-coach/

  26. arXiv:2212.04029  [pdf, other

    cs.CV

    Occlusion-Robust FAU Recognition by Mining Latent Space of Masked Autoencoders

    Authors: Minyang Jiang, Yongwei Wang, Martin J. McKeown, Z. Jane Wang

    Abstract: Facial action units (FAUs) are critical for fine-grained facial expression analysis. Although FAU detection has been actively studied using ideally high quality images, it was not thoroughly studied under heavily occluded conditions. In this paper, we propose the first occlusion-robust FAU recognition method to maintain FAU detection performance under heavy occlusions. Our novel approach takes adv… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  27. arXiv:2211.04020  [pdf, other

    q-bio.QM cs.LG q-bio.GN q-bio.TO

    Generating counterfactual explanations of tumor spatial proteomes to discover effective strategies for enhancing immune infiltration

    Authors: Zitong Jerry Wang, Alexander M. Xu, Aman Bhargava, Matt W. Thomson

    Abstract: The tumor microenvironment (TME) significantly impacts cancer prognosis due to its immune composition. While therapies for altering the immune composition, including immunotherapies, have shown exciting results for treating hematological cancers, they are less effective for immunologically-cold, solid tumors. Spatial omics technologies capture the spatial organization of the TME with unprecedented… ▽ More

    Submitted 13 October, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

  28. arXiv:2210.14896  [pdf, other

    cs.CV cs.AI cs.HC cs.LG

    DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models

    Authors: Zijie J. Wang, Evan Montoya, David Munechika, Haoyang Yang, Benjamin Hoover, Duen Horng Chau

    Abstract: With recent advancements in diffusion models, users can generate high-quality images by writing text prompts in natural language. However, generating images with desired details requires proper prompts, and it is often unclear how a model reacts to different prompts or what the best prompts are. To help researchers tackle these critical challenges, we introduce DiffusionDB, the first large-scale t… ▽ More

    Submitted 6 July, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: Accepted to ACL 2023 (nominated for best paper, top 1.6% of submissions, oral presentation). 17 pages, 11 figures. The dataset is available at https://huggingface.co/datasets/poloclub/diffusiondb. The code is at https://github.com/poloclub/diffusiondb. The interactive visualization demo is at https://poloclub.github.io/diffusiondb/explorer/

  29. arXiv:2210.00160  [pdf, other

    cs.SI cs.CR cs.CY cs.HC

    Explaining Website Reliability by Visualizing Hyperlink Connectivity

    Authors: Seongmin Lee, Sadia Afroz, Haekyu Park, Zijie J. Wang, Omar Shaikh, Vibhor Sehgal, Ankit Peshin, Duen Horng Chau

    Abstract: As the information on the Internet continues growing exponentially, understanding and assessing the reliability of a website is becoming increasingly important. Misinformation has far-ranging repercussions, from sowing mistrust in media to undermining democratic elections. While some research investigates how to alert people to misinformation on the web, much less research has been conducted on ex… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

    Comments: Accepted at IEEE VIS 2022, 5 pages, 4 figures, For a live demo, visit https://poloclub.github.io/MisVis

  30. TimberTrek: Exploring and Curating Sparse Decision Trees with Interactive Visualization

    Authors: Zijie J. Wang, Chudi Zhong, Rui Xin, Takuya Takagi, Zhi Chen, Duen Horng Chau, Cynthia Rudin, Margo Seltzer

    Abstract: Given thousands of equally accurate machine learning (ML) models, how can users choose among them? A recent ML technique enables domain experts and data scientists to generate a complete Rashomon set for sparse decision trees--a huge set of almost-optimal interpretable ML models. To help ML practitioners identify models with desirable properties from this Rashomon set, we develop TimberTrek, the f… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: Accepted at IEEE VIS 2022. 5 pages, 6 figures. For a demo video, see https://youtu.be/3eGqTmsStJM. For a live demo, visit https://poloclub.github.io/timbertrek

  31. arXiv:2209.04966  [pdf, other

    cs.CV cs.RO

    Multi-modal Streaming 3D Object Detection

    Authors: Mazen Abdelfattah, Kaiwen Yuan, Z. Jane Wang, Rabab Ward

    Abstract: Modern autonomous vehicles rely heavily on mechanical LiDARs for perception. Current perception methods generally require 360° point clouds, collected sequentially as the LiDAR scans the azimuth and acquires consecutive wedge-shaped slices. The acquisition latency of a full scan (~ 100ms) may lead to outdated perception which is detrimental to safe operation. Recent streaming perception works prop… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

  32. arXiv:2207.02321  [pdf, ps, other

    math.DS

    Local rigidity for hyperbolic toral automorphisms

    Authors: Boris Kalinin, Victoria Sadovskaya, Zhenqi Jenny Wang

    Abstract: We consider a hyperbolic toral automorphism $L$ and its $C^1$-small perturbation $f$. It is well-known that $f$ is Anosov and topologically conjugate to $L$, but a conjugacy $H$ is only Hölder continuous in general. We discuss conditions for smoothness of $H$, such as conjugacy of the periodic data of $f$ and $L$, coincidence of their Lyapunov exponents, and weaker regularity of $H$, and we summar… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: 13 pages. arXiv admin note: text overlap with arXiv:2111.01309

    MSC Class: 37D20; 37C15

  33. arXiv:2206.15465  [pdf, other

    cs.LG cs.AI cs.HC

    Interpretability, Then What? Editing Machine Learning Models to Reflect Human Knowledge and Values

    Authors: Zijie J. Wang, Alex Kale, Harsha Nori, Peter Stella, Mark E. Nunnally, Duen Horng Chau, Mihaela Vorvoreanu, Jennifer Wortman Vaughan, Rich Caruana

    Abstract: Machine learning (ML) interpretability techniques can reveal undesirable patterns in data that models exploit to make predictions--potentially causing harms once deployed. However, how to take action to address these patterns is not always clear. In a collaboration between ML and human-computer interaction researchers, physicians, and data scientists, we develop GAM Changer, the first interactive… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: Accepted at KDD 2022. 11 pages, 19 figures. For a demo video, see https://youtu.be/D6whtfInqTc. For a live demo, visit https://interpret.ml/gam-changer

  34. arXiv:2206.13801  [pdf, other

    cs.IT eess.SP

    Joint Precoding for Active Intelligent Transmitting Surface Empowered Outdoor-to-Indoor Communication in mmWave Cellular Networks

    Authors: Xie Xie, Chen He, Feifei Gao, Zhu Han, Z. Jane Wang

    Abstract: Outdoor-to-indoor communications in millimeter-wave (mmWave) cellular networks have been one challenging research problem due to the severe attenuation and the high penetration loss caused by the propagation characteristics of mmWave signals. We propose a viable solution to implement the outdoor-to-indoor mmWave communication system with the aid of an active intelligent transmitting surface (activ… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: 30 pages, 8 figures

  35. arXiv:2206.12540  [pdf, other

    cs.HC cs.LG

    Visual Auditor: Interactive Visualization for Detection and Summarization of Model Biases

    Authors: David Munechika, Zijie J. Wang, Jack Reidy, Josh Rubin, Krishna Gade, Krishnaram Kenthapadi, Duen Horng Chau

    Abstract: As machine learning (ML) systems become increasingly widespread, it is necessary to audit these systems for biases prior to their deployment. Recent research has developed algorithms for effectively identifying intersectional bias in the form of interpretable, underperforming subsets (or slices) of the data. However, these solutions and their insights are limited without a tool for visually unders… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

  36. arXiv:2206.05375  [pdf, other

    cs.CV

    Generalizable Neural Radiance Fields for Novel View Synthesis with Transformer

    Authors: Dan Wang, Xinrui Cui, Septimiu Salcudean, Z. Jane Wang

    Abstract: We propose a Transformer-based NeRF (TransNeRF) to learn a generic neural radiance field conditioned on observed-view images for the novel view synthesis task. By contrast, existing MLP-based NeRFs are not able to directly receive observed views with an arbitrary number and require an auxiliary pooling-based operation to fuse source-view information, resulting in the missing of complicated relatio… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  37. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  38. arXiv:2205.09744  [pdf, other

    cs.LG cs.CY cs.MM

    Overcoming Language Disparity in Online Content Classification with Multimodal Learning

    Authors: Gaurav Verma, Rohit Mujumdar, Zijie J. Wang, Munmun De Choudhury, Srijan Kumar

    Abstract: Advances in Natural Language Processing (NLP) have revolutionized the way researchers and practitioners address crucial societal problems. Large language models are now the standard to develop state-of-the-art solutions for text detection and classification tasks. However, the development of advanced computational techniques and resources is disproportionately focused on the English language, side… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted for publication at ICWSM 2022 as a full paper

  39. arXiv:2205.03963  [pdf, other

    cs.HC

    NOVA: A Practical Method for Creating Notebook-Ready Visual Analytics

    Authors: Zijie J. Wang, David Munechika, Seongmin Lee, Duen Horng Chau

    Abstract: How can we develop visual analytics (VA) tools that can be easily adopted? Visualization researchers have developed a large number of web-based VA tools to help data scientists in a wide range of tasks. However, adopting these standalone systems can be challenging, as they require data scientists to create new workflows to streamline the VA processes. Recent surveys suggest computational notebooks… ▽ More

    Submitted 15 May, 2023; v1 submitted 8 May, 2022; originally announced May 2022.

    Comments: Accepted to IEEE VIS 2022 (poster). 2 pages, 1 figure. For a live demo, visit https://poloclub.github.io/nova. For method application examples, see https://github.com/poloclub/nova

  40. arXiv:2204.05899  [pdf, other

    cs.CV cs.HC cs.LG

    VisCUIT: Visual Auditor for Bias in CNN Image Classifier

    Authors: Seongmin Lee, Zijie J. Wang, Judy Hoffman, Duen Horng Chau

    Abstract: CNN image classifiers are widely used, thanks to their efficiency and accuracy. However, they can suffer from biases that impede their practical applications. Most existing bias investigation techniques are either inapplicable to general image classification tasks or require significant user efforts in perusing all data subgroups to manually specify which data attributes to inspect. We present Vis… ▽ More

    Submitted 13 April, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

    Comments: 9 pages, 4 figures

  41. arXiv:2203.11490  [pdf, other

    cs.CV

    SSD-KD: A Self-supervised Diverse Knowledge Distillation Method for Lightweight Skin Lesion Classification Using Dermoscopic Images

    Authors: Yongwei Wang, Yuheng Wang, Tim K. Lee, Chunyan Miao, Z. Jane Wang

    Abstract: Skin cancer is one of the most common types of malignancy, affecting a large population and causing a heavy economic burden worldwide. Over the last few years, computer-aided diagnosis has been rapidly developed and make great progress in healthcare and medical practices due to the advances in artificial intelligence. However, most studies in skin cancer detection keep pursuing high prediction acc… ▽ More

    Submitted 29 March, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: 14 pages, 5 figures

  42. arXiv:2203.08176  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    SemiPFL: Personalized Semi-Supervised Federated Learning Framework for Edge Intelligence

    Authors: Arvin Tashakori, Wenwen Zhang, Z. Jane Wang, Peyman Servati

    Abstract: Recent advances in wearable devices and Internet-of-Things (IoT) have led to massive growth in sensor data generated in edge devices. Labeling such massive data for classification tasks has proven to be challenging. In addition, data generated by different users bear various personal attributes and edge heterogeneity, rendering it impractical to develop a global model that adapts well to all users… ▽ More

    Submitted 19 November, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

  43. StickyLand: Breaking the Linear Presentation of Computational Notebooks

    Authors: Zijie J. Wang, Katie Dai, W. Keith Edwards

    Abstract: How can we better organize code in computational notebooks? Notebooks have become a popular tool among data scientists, as they seamlessly weave text and code together, supporting users to rapidly iterate and document code experiments. However, it is often challenging to organize code in notebooks, partially because there is a mismatch between the linear presentation of code and the non-linear pro… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: Accepted at CHI 2022 (Late-Breaking Work). 7 pages, 6 figures. For a demo video, see https://youtu.be/OKaPmEBzEX0. For a live demo, visit https://zijie.wang/#stickyland-demo

  44. arXiv:2201.09685  [pdf, other

    cs.IT eess.SP

    Robust Joint Design for Intelligent Reflecting Surfaces Assisted Cell-Free Networks

    Authors: Xie Xie, Chen He, Xiaoya Li, Zhu Han, Z. Jane Wang

    Abstract: Intelligent reflecting surfaces (IRSs) have emerged as a promising economical solution to implement cell-free networks. However, the performance gains achieved by IRSs critically depend on smartly tuned passive beamforming based on the assumption that the accurate channel state information (CSI) knowledge is available, which is practically impossible. Thus, in this paper, we investigate the impact… ▽ More

    Submitted 20 February, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

    Comments: 30 pages

  45. arXiv:2112.11593  [pdf, other

    cs.CV

    AdaptPose: Cross-Dataset Adaptation for 3D Human Pose Estimation by Learnable Motion Generation

    Authors: Mohsen Gholami, Bastian Wandt, Helge Rhodin, Rabab Ward, Z. Jane Wang

    Abstract: This paper addresses the problem of cross-dataset generalization of 3D human pose estimation models. Testing a pre-trained 3D pose estimator on a new dataset results in a major performance drop. Previous methods have mainly addressed this problem by improving the diversity of the training data. We argue that diversity alone is not sufficient and that the characteristics of the training data need t… ▽ More

    Submitted 15 March, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

  46. arXiv:2112.06654  [pdf, other

    eess.SP cs.HC cs.LG

    Toward Open-World Electroencephalogram Decoding Via Deep Learning: A Comprehensive Survey

    Authors: Xun Chen, Chang Li, Ai** Liu, Martin J. McKeown, Ruobing Qian, Z. Jane Wang

    Abstract: Electroencephalogram (EEG) decoding aims to identify the perceptual, semantic, and cognitive content of neural processing based on non-invasively measured brain activity. Traditional EEG decoding methods have achieved moderate success when applied to data acquired in static, well-controlled lab environments. However, an open-world environment is a more realistic setting, where situations affecting… ▽ More

    Submitted 16 December, 2021; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: Accepted by the IEEE Signal Processing Magazine

  47. arXiv:2112.03245  [pdf, other

    cs.LG cs.AI cs.HC

    GAM Changer: Editing Generalized Additive Models with Interactive Visualization

    Authors: Zijie J. Wang, Alex Kale, Harsha Nori, Peter Stella, Mark Nunnally, Duen Horng Chau, Mihaela Vorvoreanu, Jennifer Wortman Vaughan, Rich Caruana

    Abstract: Recent strides in interpretable machine learning (ML) research reveal that models exploit undesirable patterns in the data to make predictions, which potentially causes harms in deployment. However, it is unclear how we can fix these models. We present our ongoing work, GAM Changer, an open-source interactive system to help data scientists and domain experts easily and responsibly edit their Gener… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: 7 pages, 15 figures, accepted to the Research2Clinics workshop at NeurIPS 2021. For a demo video, see https://youtu.be/2gVSoPoSeJ8. For a live demo, visit https://interpret.ml/gam-changer/

  48. arXiv:2112.02721  [pdf, other

    cs.CL cs.AI cs.LG

    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

    Authors: Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, **ho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo , et al. (101 additional authors not shown)

    Abstract: Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 39 pages, repository at https://github.com/GEM-benchmark/NL-Augmenter

  49. arXiv:2111.01309  [pdf, ps, other

    math.DS

    Smooth local rigidity for hyperbolic toral automorphisms

    Authors: Boris Kalinin, Victoria Sadovskaya, Zhenqi Jenny Wang

    Abstract: We study the regularity of a conjugacy $H$ between a hyperbolic toral automorphism $A$ and its smooth perturbation $f$ We show that if $H$ is weakly differentiable then it is $C^{1+Hölder}$ and, if $A$ is also weakly irreducible, then $H$ is $C^\infty$. As a part of the proof, we establish results of independent interest on Hölder continuity of a measurable conjugacy between linear cocycles over a… ▽ More

    Submitted 5 July, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: 43 pages

    MSC Class: 37D20; 37C15

  50. Rethinking Crowdsourcing Annotation: Partial Annotation with Salient Labels for Multi-Label Image Classification

    Authors: Jianzhe Lin, Tianze Yu, Z. Jane Wang

    Abstract: Annotated images are required for both supervised model training and evaluation in image classification. Manually annotating images is arduous and expensive, especially for multi-labeled images. A recent trend for conducting such laboursome annotation tasks is through crowdsourcing, where images are annotated by volunteers or paid workers online (e.g., workers of Amazon Mechanical Turk) from scrat… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.