Skip to main content

Showing 1–50 of 95 results for author: Wu, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.04674  [pdf, other

    cs.DB

    Towards Accurate and Efficient Document Analytics with Large Language Models

    Authors: Yiming Lin, Madelon Hulsebos, Ruiying Ma, Shreya Shankar, Sepanta Zeigham, Aditya G. Parameswaran, Eugene Wu

    Abstract: Unstructured data formats account for over 80% of the data currently stored, and extracting value from such formats remains a considerable challenge. In particular, current approaches for managing unstructured documents do not support ad-hoc analytical queries on document collections. Moreover, Large Language Models (LLMs) directly applied to the documents themselves, or on portions of documents t… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  2. arXiv:2405.04042  [pdf, other

    cs.CV cs.AI

    Space-time Reinforcement Network for Video Object Segmentation

    Authors: Yadang Chen, Wentao Zhu, Zhi-Xin Yang, Enhua Wu

    Abstract: Recently, video object segmentation (VOS) networks typically use memory-based methods: for each query frame, the mask is predicted by space-time matching to memory frames. Despite these methods having superior performance, they suffer from two issues: 1) Challenging data can destroy the space-time coherence between adjacent video frames. 2) Pixel-level matching will lead to undesired mismatching c… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted by ICME 2024. 6 pages, 10 figures

  3. arXiv:2404.12552  [pdf, other

    cs.DB

    Cocoon: Semantic Table Profiling Using Large Language Models

    Authors: Zezhou Huang, Eugene Wu

    Abstract: Data profilers play a crucial role in the preprocessing phase of data analysis by identifying quality issues such as missing, extreme, or erroneous values. Traditionally, profilers have relied solely on statistical methods, which lead to high false positives and false negatives. For example, they may incorrectly flag missing values where such absences are expected and normal based on the data's se… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  4. arXiv:2404.10198  [pdf, other

    cs.CL cs.AI

    ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence

    Authors: Kevin Wu, Eric Wu, James Zou

    Abstract: Retrieval augmented generation (RAG) is frequently used to mitigate hallucinations and provide up-to-date knowledge for large language models (LLMs). However, given that document retrieval is an imprecise task and sometimes results in erroneous or even harmful content being presented in context, this raises the question of how LLMs handle retrieved information: If the provided content is incorrect… ▽ More

    Submitted 10 June, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: Revised June 9 2024

  5. arXiv:2403.04261  [pdf

    cs.AI cs.CL cs.LG

    Advancing Biomedical Text Mining with Community Challenges

    Authors: Hui Zong, Rongrong Wu, Jiaxue Cha, Erman Wu, Jiakun Li, Liang Tao, Zuofeng Li, Buzhou Tang, Bairong Shen

    Abstract: The field of biomedical research has witnessed a significant increase in the accumulation of vast amounts of textual data from various sources such as scientific literatures, electronic health records, clinical trial reports, and social media. However, manually processing and analyzing these extensive and complex resources is time-consuming and inefficient. To address this challenge, biomedical te… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  6. arXiv:2402.05160  [pdf, other

    cs.SE cs.AI cs.LG

    What's documented in AI? Systematic Analysis of 32K AI Model Cards

    Authors: Weixin Liang, Nazneen Rajani, Xinyu Yang, Ezinwanne Ozoani, Eric Wu, Yiqun Chen, Daniel Scott Smith, James Zou

    Abstract: The rapid proliferation of AI models has underscored the importance of thorough documentation, as it enables users to understand, trust, and effectively utilize these models in various applications. Although developers are encouraged to produce model cards, it's not clear how much information or what information these cards contain. In this study, we conduct a comprehensive analysis of 32,111 AI m… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  7. arXiv:2402.02008  [pdf, other

    cs.CL cs.AI

    How well do LLMs cite relevant medical references? An evaluation framework and analyses

    Authors: Kevin Wu, Eric Wu, Ally Cassasola, Angela Zhang, Kevin Wei, Teresa Nguyen, Sith Riantawan, Patricia Shi Riantawan, Daniel E. Ho, James Zou

    Abstract: Large language models (LLMs) are currently being used to answer medical questions across a variety of clinical domains. Recent top-performing commercial LLMs, in particular, are also capable of citing sources to support their responses. In this paper, we ask: do the sources that LLMs generate actually support the claims that they make? To answer this, we propose three contributions. First, as expe… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  8. arXiv:2401.03038  [pdf, other

    cs.DB cs.SE

    SPADE: Synthesizing Data Quality Assertions for Large Language Model Pipelines

    Authors: Shreya Shankar, Haotian Li, Parth Asawa, Madelon Hulsebos, Yiming Lin, J. D. Zamfirescu-Pereira, Harrison Chase, Will Fu-Hinthorn, Aditya G. Parameswaran, Eugene Wu

    Abstract: Large language models (LLMs) are being increasingly deployed as part of pipelines that repeatedly process or generate data of some sort. However, a common barrier to deployment are the frequent and often unpredictable errors that plague LLMs. Acknowledging the inevitability of these errors, we propose {\em data quality assertions} to identify when LLMs may be making mistakes. We present SPADE, a m… ▽ More

    Submitted 31 March, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: 17 pages, 6 figures

  9. arXiv:2401.01456  [pdf, other

    cs.CV

    ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text

    Authors: Dingkun Yan, Liang Yuan, Erwin Wu, Yuma Nishioka, Issei Fujishiro, Suguru Saito

    Abstract: Diffusion models have recently demonstrated their effectiveness in generating extremely high-quality images and are now utilized in a wide range of applications, including automatic sketch colorization. Although many methods have been developed for guided sketch colorization, there has been limited exploration of the potential conflicts between image prompts and sketch inputs, which can lead to se… ▽ More

    Submitted 2 July, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  10. arXiv:2312.14943  [pdf, other

    cs.IR cs.CL cs.LG

    Flood Event Extraction from News Media to Support Satellite-Based Flood Insurance

    Authors: Tejit Pabari, Beth Tellman, Giannis Karamanolakis, Mitchell Thomas, Max Mauerman, Eugene Wu, Upmanu Lall, Marco Tedesco, Michael S Steckler, Paolo Colosio, Daniel E Osgood, Melody Braun, Jens de Bruijn, Shammun Islam

    Abstract: Floods cause large losses to property, life, and livelihoods across the world every year, hindering sustainable development. Safety nets to help absorb financial shocks in disasters, such as insurance, are often unavailable in regions of the world most vulnerable to floods, like Bangladesh. Index-based insurance has emerged as an affordable solution, which considers weather data or information fro… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  11. arXiv:2310.18742  [pdf, other

    cs.DB

    Data Ambiguity Strikes Back: How Documentation Improves GPT's Text-to-SQL

    Authors: Zezhou Huang, Pavan Kalyan Damalapati, Eugene Wu

    Abstract: Text-to-SQL allows experts to use databases without in-depth knowledge of them. However, real-world tasks have both query and data ambiguities. Most works on Text-to-SQL focused on query ambiguities and designed chat interfaces for experts to provide clarifications. In contrast, the data management community has long studied data ambiguities, but mainly addresses error detection and correction, ra… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  12. arXiv:2310.00902  [pdf, other

    cs.LG stat.ML

    DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models

    Authors: Yongchan Kwon, Eric Wu, Kevin Wu, James Zou

    Abstract: Quantifying the impact of training data points is crucial for understanding the outputs of machine learning models and for improving the transparency of the AI pipeline. The influence function is a principled and popular data attribution method, but its computational cost often makes it challenging to use. This issue becomes more pronounced in the setting of large language models and text-to-image… ▽ More

    Submitted 13 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  13. arXiv:2308.12480  [pdf, other

    cs.DB

    Lightweight Materialization for Fast Dashboards Over Joins

    Authors: Zezhou Huang, Eugene Wu

    Abstract: Dashboards are vital in modern business intelligence tools, providing non-technical users with an interface to access comprehensive business data. With the rise of cloud technology, there is an increased number of data sources to provide enriched contexts for various analytical tasks, leading to a demand for interactive dashboards over a large number of joins. Nevertheless, joins are among the mos… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Journal ref: SIGMOD 2024

  14. arXiv:2308.05637  [pdf, other

    cs.DB

    The Fast and the Private: Task-based Dataset Search

    Authors: Zezhou Huang, Jiaxiang Liu, Haonan Wang, Eugene Wu

    Abstract: Modern dataset search platforms employ ML task-based utility metrics instead of relying on metadata-based keywords to comb through extensive dataset repositories. In this setup, requesters provide an initial dataset, and the platform identifies complementary datasets to augment (join or union) the requester's dataset such that the ML model (e.g., linear regression) performance is improved most. Al… ▽ More

    Submitted 20 August, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

  15. DIG: The Data Interface Grammar

    Authors: Yiru Chen, Jeffery Tao, Eugene Wu

    Abstract: Building interactive data interfaces is hard because the design of an interface depends on the data processing needs for the underlying analysis task, yet we do not have a good representation for analysis tasks. To fill this gap, this paper advocates for a Data Interface Grammar (DIG) as an intermediate representation of analysis tasks. We show that DIG is compatible with existing data engineering… ▽ More

    Submitted 16 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: 7 pages, Workshop on Human-In-the-Loop Data Analytics(HILDA) at SIGMOD 2023

    ACM Class: H.2; H.5.2; H.2.3

  16. arXiv:2307.00432  [pdf, other

    cs.DB cs.CR

    Saibot: A Differentially Private Data Search Platform

    Authors: Zezhou Huang, Jiaxiang Liu, Daniel Alabi, Raul Castro Fernandez, Eugene Wu

    Abstract: Recent data search platforms use ML task-based utility measures rather than metadata-based keywords, to search large dataset corpora. Requesters submit a training dataset and these platforms search for augmentations (join or union compatible datasets) that, when used to augment the requester's dataset, most improve model (e.g., linear regression) performance. Although effective, providers that man… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Journal ref: VLDB 2023

  17. arXiv:2307.00422  [pdf, other

    cs.DB cs.LG

    JoinBoost: Grow Trees Over Normalized Data Using Only SQL

    Authors: Zezhou Huang, Rathijit Sen, Jiaxiang Liu, Eugene Wu

    Abstract: Although dominant for tabular data, ML libraries that train tree models over normalized databases (e.g., LightGBM, XGBoost) require the data to be denormalized as a single table, materialized, and exported. This process is not scalable, slow, and poses security risks. In-DB ML aims to train models within DBMSes to avoid data movement and provide data governance. Rather than modify a DBMS to suppor… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Journal ref: VLDB 2023

  18. Aggregation Consistency Errors in Semantic Layers and How to Avoid Them

    Authors: Zezhou Huang, Pavan Kalyan Damalapati, Eugene Wu

    Abstract: Analysts often struggle with analyzing data from multiple tables in a database due to their lack of knowledge on how to join and aggregate the data. To address this, data engineers pre-specify "semantic layers" which include the join conditions and "metrics" of interest with aggregation functions and expressions. However, joins can cause "aggregation consistency issues". For example, analysts may… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Journal ref: Proceedings of the Workshop on Human-In-the-Loop Data Analytics 2023

  19. arXiv:2306.14525  [pdf, other

    cs.CV

    ParameterNet: Parameters Are All You Need

    Authors: Kai Han, Yunhe Wang, Jianyuan Guo, Enhua Wu

    Abstract: The large-scale visual pretraining has significantly improve the performance of large vision models. However, we observe the \emph{low FLOPs pitfall} that the existing low-FLOPs models cannot benefit from large-scale pretraining. In this paper, we introduce a novel design principle, termed ParameterNet, aimed at augmenting the number of parameters in large-scale visual pretraining models while min… ▽ More

    Submitted 14 January, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: https://parameternet.github.io/

  20. arXiv:2305.10419  [pdf, other

    cs.DB

    Kitana: Efficient Data Augmentation Search for AutoML

    Authors: Zezhou Huang, Pranav Subramaniam, Raul Castro Fernandez, Eugene Wu

    Abstract: AutoML services provide a way for non-expert users to benefit from high-quality ML models without worrying about model design and deployment, in exchange for a charge per hour ($21.252 for VertexAI). However, existing AutoML services are model-centric, in that they are limited to extracting features and searching for models from initial training data-they are only as effective as the initial train… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  21. arXiv:2304.11840  [pdf, other

    cs.CV cs.MM

    Robust and Efficient Memory Network for Video Object Segmentation

    Authors: Yadang Chen, Dingwei Zhang, Zhi-xin Yang, Enhua Wu

    Abstract: This paper proposes a Robust and Efficient Memory Network, referred to as REMN, for studying semi-supervised video object segmentation (VOS). Memory-based methods have recently achieved outstanding VOS performance by performing non-local pixel-wise matching between the query and memory. However, these methods have two limitations. 1) Non-local matching could cause distractor objects in the backgro… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: Accepted by ICME 2023. 6 pages, 6 figures

  22. arXiv:2304.02819  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    GPT detectors are biased against non-native English writers

    Authors: Weixin Liang, Mert Yuksekgonul, Yining Mao, Eric Wu, James Zou

    Abstract: The rapid adoption of generative language models has brought about substantial advancements in digital communication, while simultaneously raising concerns regarding the potential misuse of AI-generated content. Although numerous detection methods have been proposed to differentiate between AI and human-generated content, the fairness and robustness of these detectors remain underexplored. In this… ▽ More

    Submitted 10 July, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

  23. arXiv:2303.07080  [pdf, other

    cs.CV

    Bag of Tricks with Quantized Convolutional Neural Networks for image classification

    Authors: Jie Hu, Mengze Zeng, Enhua Wu

    Abstract: Deep neural networks have been proven effective in a wide range of tasks. However, their high computational and memory costs make them impractical to deploy on resource-constrained devices. To address this issue, quantization schemes have been proposed to reduce the memory footprint and improve inference speed. While numerous quantization methods have been proposed, they lack systematic analysis f… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: ICASSP 2023

  24. arXiv:2302.08760  [pdf, other

    cs.CV cs.AI cs.LG

    3D Human Pose Lifting with Grid Convolution

    Authors: Yangyuxuan Kang, Yuyang Liu, Anbang Yao, Shandong Wang, Enhua Wu

    Abstract: Existing lifting networks for regressing 3D human poses from 2D single-view poses are typically constructed with linear layers based on graph-structured representation learning. In sharp contrast to them, this paper presents Grid Convolution (GridConv), mimicking the wisdom of regular convolution operations in image space. GridConv is based on a novel Semantic Grid Transformation (SGT) which lever… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: Oral paper at AAAI 2023. Project website: https://github.com/OSVAI/GridConv

  25. arXiv:2301.01367  [pdf, other

    cs.GT math.OC

    The Price of Anarchy of the Asymmetric One-Sided Allocation Problem

    Authors: Sissi Jiang, Ndiame Ndiaye, Adrian Vetta, Eggie Wu

    Abstract: We study fair mechanisms for the (asymmetric) one-sided allocation problem with m items and n multi-unit demand agents with additive, unit-sum valuations. The symmetric case (m=n), the one-sided matching problem, has been studied extensively for the class of unit demand agents, in particular with respect to the folklore Random Priority mechanism and the Probabilistic Serial mechanism, introduced b… ▽ More

    Submitted 13 May, 2023; v1 submitted 3 January, 2023; originally announced January 2023.

    Comments: 32 pages, 4 figures

  26. arXiv:2212.10200  [pdf, other

    cs.CV

    Redistribution of Weights and Activations for AdderNet Quantization

    Authors: Ying Nie, Kai Han, Haikang Diao, Chuanjian Liu, Enhua Wu, Yunhe Wang

    Abstract: Adder Neural Network (AdderNet) provides a new way for develo** energy-efficient neural networks by replacing the expensive multiplications in convolution with cheaper additions (i.e.l1-norm). To achieve higher hardware efficiency, it is necessary to further study the low-bit quantization of AdderNet. Due to the limitation that the commutative law in multiplication does not hold in l1-norm, the… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  27. arXiv:2210.03851  [pdf, other

    cs.DB

    Calibration: A Simple Trick for Wide-table Delta Analytics

    Authors: Zezhou Huang, Eugene Wu

    Abstract: Data analytics over normalized databases typically requires computing and materializing expensive joins (wide-tables). Factorized query execution models execution as message passing between relations in the join graph and pushes aggregations through joins to reduce intermediate result sizes. Although this accelerates query execution, it only optimizes a single wide-table query. In contrast, wide-t… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  28. arXiv:2209.15611  [pdf, other

    q-bio.BM cs.AI

    Protein structure generation via folding diffusion

    Authors: Kevin E. Wu, Kevin K. Yang, Rianne van den Berg, James Y. Zou, Alex X. Lu, Ava P. Amini

    Abstract: The ability to computationally generate novel yet physically foldable protein structures could lead to new biological discoveries and new treatments targeting yet incurable diseases. Despite recent advances in protein structure prediction, directly generating diverse, novel protein structures from neural networks remains difficult. In this work, we present a new diffusion-based generative model th… ▽ More

    Submitted 23 November, 2022; v1 submitted 30 September, 2022; originally announced September 2022.

    ACM Class: I.2.0; J.3

  29. arXiv:2209.15278  [pdf, other

    cs.LG cs.AI cs.NE

    Rethinking skip connection model as a learnable Markov chain

    Authors: Dengsheng Chen, Jie Hu, Wenwen Qiang, Xiaoming Wei, Enhua Wu

    Abstract: Over past few years afterward the birth of ResNet, skip connection has become the defacto standard for the design of modern architectures due to its widespread adoption, easy optimization and proven performance. Prior work has explained the effectiveness of the skip connection mechanism from different perspectives. In this work, we deep dive into the model's behaviors with skip connections which c… ▽ More

    Submitted 2 March, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: 12 pages, 4 figures

  30. arXiv:2209.08834  [pdf, other

    cs.HC cs.AI cs.CL cs.DB

    NL2INTERFACE: Interactive Visualization Interface Generation from Natural Language Queries

    Authors: Yiru Chen, Ryan Li, Austin Mac, Tianbao Xie, Tao Yu, Eugene Wu

    Abstract: We develop NL2INTERFACE to explore the potential of generating usable interactive multi-visualization interfaces from natural language queries. With NL2INTERFACE, users can directly write natural language queries to automatically generate a fully interactive multi-visualization interface without any extra effort of learning a tool or programming language. Further, users can interact with the inter… ▽ More

    Submitted 24 September, 2022; v1 submitted 19 September, 2022; originally announced September 2022.

    Comments: 5 pages, IEEE Visualization Conference NLVIZ Workshop 2022

    ACM Class: H.5.2; H.2; I.2.7

    Journal ref: IEEE Visualization Conference NLVIZ Workshop 2022

  31. arXiv:2206.11448  [pdf, ps, other

    cs.LG cs.AI

    Efficient Adaptive Federated Optimization of Federated Learning for IoT

    Authors: Zunming Chen, Hongyan Cui, Ensen Wu, Yu Xi

    Abstract: The proliferation of the Internet of Things (IoT) and widespread use of devices with sensing, computing, and communication capabilities have motivated intelligent applications empowered by artificial intelligence. The classical artificial intelligence algorithms require centralized data collection and processing which are challenging in realistic intelligent IoT applications due to growing data pr… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  32. arXiv:2206.00272  [pdf, other

    cs.CV

    Vision GNN: An Image is Worth Graph of Nodes

    Authors: Kai Han, Yunhe Wang, Jianyuan Guo, Yehui Tang, Enhua Wu

    Abstract: Network architecture plays a key role in the deep learning-based computer vision system. The widely-used convolutional neural network and transformer treat the image as a grid or sequence structure, which is not flexible to capture irregular and complex objects. In this paper, we propose to represent the image as a graph structure and introduce a new Vision GNN (ViG) architecture to extract graph-… ▽ More

    Submitted 4 November, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022

  33. arXiv:2205.04148  [pdf, other

    cs.DC

    Productive Performance Engineering for Weather and Climate Modeling with Python

    Authors: Tal Ben-Nun, Linus Groner, Florian Deconinck, Tobias Wicky, Eddie Davis, Johann Dahm, Oliver D. Elbert, Rhea George, Jeremy McGibbon, Lukas Trümper, Elynn Wu, Oliver Fuhrer, Thomas Schulthess, Torsten Hoefler

    Abstract: Earth system models are developed with a tight coupling to target hardware, often containing specialized code predicated on processor characteristics. This coupling stems from using imperative languages that hard-code computation schedules and layout. We present a detailed account of optimizing the Finite Volume Cubed-Sphere Dynamical Core (FV3), improving productivity and performance. By using a… ▽ More

    Submitted 25 August, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

  34. arXiv:2205.01283  [pdf, other

    cs.DB cs.HC

    Extending the View Composition Algebra to Hierarchical Data

    Authors: Eugene Wu

    Abstract: Comparison is a core task in visual analysis. Although there are numerous guidelines to help users design effective visualizations to aid known comparison tasks, there are few formalisms that define the semantics of comparison operations in a way that can serve as the basis for a grammar of comparison interactions. Recent work proposed a formalism called View Composition Algebra (VCA) that enables… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  35. arXiv:2205.01263  [pdf, other

    cs.HC

    How Do Captions Affect Visualization Reading?

    Authors: Hanxiu 'Hazel' Zhu, Shelly Shiying Cheng, Eugene Wu

    Abstract: Captions help readers better understand visualizations. However, if the visualization is intended to communicate specific features, should the caption be statistical, and focus on specific values, or perceptual, and focus on general patterns? Prior work has shown that when captions mention visually salient features, readers tend to recall those features. Still, we lack explicit guidelines for how… ▽ More

    Submitted 26 September, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

    ACM Class: H.5.2

  36. arXiv:2204.14267  [pdf, other

    cs.HC cs.DB cs.FL

    A Grammar of Hypotheses for Visualization, Data, and Analysis

    Authors: Ashley Suh, Ab Mosca, Eugene Wu, Remco Chang

    Abstract: We present a grammar for expressing hypotheses in visual data analysis to formalize the previously abstract notion of "analysis tasks." Through the lens of our grammar, we lay the groundwork for how a user's data analysis questions can be operationalized and automated as a set of hypotheses (a hypothesis space). We demonstrate that our grammar-based approach for analysis tasks can provide a system… ▽ More

    Submitted 3 April, 2023; v1 submitted 29 April, 2022; originally announced April 2022.

  37. arXiv:2202.07836  [pdf, other

    cs.HC cs.DB

    View Composition Algebra for Ad Hoc Comparison

    Authors: Eugene Wu

    Abstract: Comparison is a core task in visual analysis. Although there are numerous guidelines to help users design effective visualizations to aid known comparison tasks, there are few techniques available when users want to make ad hoc comparisons between marks, trends, or charts during data exploration and visual analysis. For instance, to compare voting count maps from different years, two stock trends… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  38. Demonstration of PI2: Interactive Visualization Interface Generation for SQL Analysis in Notebook

    Authors: Jeffrey Tao, Yiru Chen, Eugene Wu

    Abstract: We demonstrate PI2, the first notebook extension that can automatically generate interactive visualization interfaces during SQL-based analyses.

    Submitted 14 January, 2022; originally announced January 2022.

    Comments: arXiv admin note: text overlap with arXiv:2107.08203

    ACM Class: H.2; H.5.2

    Journal ref: SIGMOD '22: Proceedings of the 2022 International Conference on Management of Data

  39. GhostNets on Heterogeneous Devices via Cheap Operations

    Authors: Kai Han, Yunhe Wang, Chang Xu, Jianyuan Guo, Chun**g Xu, Enhua Wu, Qi Tian

    Abstract: Deploying convolutional neural networks (CNNs) on mobile devices is difficult due to the limited memory and computation resources. We aim to design efficient neural networks for heterogeneous devices including CPU and GPU, by exploiting the redundancy in feature maps, which has rarely been investigated in neural architecture design. For CPU-like devices, we propose a novel CPU-efficient Ghost (C-G… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

    Comments: Accepted by IJCV 2022. Extension of GhostNet CVPR2020 paper (arXiv:1911.11907). arXiv admin note: substantial text overlap with arXiv:1911.11907

  40. A Neural Network Solves, Explains, and Generates University Math Problems by Program Synthesis and Few-Shot Learning at Human Level

    Authors: Iddo Drori, Sarah Zhang, Reece Shuttleworth, Leonard Tang, Albert Lu, Elizabeth Ke, Kevin Liu, Linda Chen, Sunny Tran, Newman Cheng, Roman Wang, Nikhil Singh, Taylor L. Patti, Jayson Lynch, Avi Shporer, Nakul Verma, Eugene Wu, Gilbert Strang

    Abstract: We demonstrate that a neural network pre-trained on text and fine-tuned on code solves mathematics course problems, explains solutions, and generates new questions at a human level. We automatically synthesize programs using few-shot learning and OpenAI's Codex transformer and execute them to solve course problems at 81% automatic accuracy. We curate a new dataset of questions from MIT's largest m… ▽ More

    Submitted 30 May, 2022; v1 submitted 31 December, 2021; originally announced December 2021.

    Comments: 181 pages, 8 figures, 280 tables

  41. arXiv:2112.10149  [pdf, other

    cs.CV

    Elastic-Link for Binarized Neural Network

    Authors: Jie Hu, Ziheng Wu, Vince Tan, Zhilin Lu, Mengze Zeng, Enhua Wu

    Abstract: Recent work has shown that Binarized Neural Networks (BNNs) are able to greatly reduce computational costs and memory footprints, facilitating model deployment on resource-constrained devices. However, in comparison to their full-precision counterparts, BNNs suffer from severe accuracy degradation. Research aiming to reduce this accuracy gap has thus far largely focused on specific network archite… ▽ More

    Submitted 17 February, 2023; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: AAAI2022

  42. arXiv:2111.08168  [pdf, other

    cs.LG cs.AI

    Explaining medical AI performance disparities across sites with confounder Shapley value analysis

    Authors: Eric Wu, Kevin Wu, James Zou

    Abstract: Medical AI algorithms can often experience degraded performance when evaluated on previously unseen sites. Addressing cross-site performance disparities is key to ensuring that AI is equitable and effective when deployed on diverse patient populations. Multi-site evaluations are key to diagnosing such disparities as they can test algorithms across a broader range of potential biases such as patien… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: Machine Learning for Health (ML4H) - Extended Abstract

  43. arXiv:2109.09618  [pdf, other

    cs.HC

    Automatic Y-axis Rescaling in Dynamic Visualizations

    Authors: Jacob Fisher, Remco Chang, Eugene Wu

    Abstract: Animated and interactive data visualizations dynamically change the data rendered in a visualization (e.g., bar chart). As the data changes, the y-axis may need to be rescaled as the domain of the data changes. Each axis rescaling potentially improves the readability of the current chart, but may also disorient the user. In contrast to static visualizations, where there is considerable literature… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: 5 pages, 7 figures, to be published in IEEE VIS 2021

  44. Learning Versatile Convolution Filters for Efficient Visual Recognition

    Authors: Kai Han, Yunhe Wang, Chang Xu, Chun**g Xu, Enhua Wu, Dacheng Tao

    Abstract: This paper introduces versatile filters to construct efficient convolutional neural networks that are widely used in various visual recognition tasks. Considering the demands of efficient deep learning techniques running on cost-effective hardware, a number of methods have been developed to learn compact neural networks. Most of these works aim to slim down filters in different ways, \eg,~investig… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: Accepted by TPAMI. Extended version of NeurIPS 2018 paper

  45. arXiv:2108.11884  [pdf

    cs.LG cs.CR cs.DB

    Enabling SQL-based Training Data Debugging for Federated Learning

    Authors: Yejia Liu, Weiyuan Wu, Lampros Flokas, Jiannan Wang, Eugene Wu

    Abstract: How can we debug a logistical regression model in a federated learning setting when seeing the model behave unexpectedly (e.g., the model rejects all high-income customers' loan applications)? The SQL-based training data debugging framework has proved effective to fix this kind of issue in a non-federated learning setting. Given an unexpected query result over model predictions, this framework aut… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

  46. arXiv:2108.08202  [pdf, other

    eess.IV cs.CV

    Overfitting the Data: Compact Neural Video Delivery via Content-aware Feature Modulation

    Authors: Jiaming Liu, Ming Lu, Kaixin Chen, Xiaoqi Li, Shizun Wang, Zhaoqing Wang, Enhua Wu, Yurong Chen, Chuang Zhang, Ming Wu

    Abstract: Internet video delivery has undergone a tremendous explosion of growth over the past few years. However, the quality of video delivery system greatly depends on the Internet bandwidth. Deep Neural Networks (DNNs) are utilized to improve the quality of video delivery recently. These methods divide a video into chunks, and stream LR video chunks and corresponding content-aware models to the client.… ▽ More

    Submitted 17 September, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: Accepted by ICCV 2021

  47. PI2: End-to-end Interactive Visualization Interface Generation from Queries

    Authors: Yiru Chen, Eugene Wu

    Abstract: Interactive visual analysis interfaces are critical in nearly every data task. However, creating new interfaces is deeply challenging, as it requires the developer to understand the queries needed to express the desired analysis task, design the appropriate interface to express those queries for the task, and implement the interface using a combination of visualization, browser, server, and databa… ▽ More

    Submitted 19 September, 2022; v1 submitted 17 July, 2021; originally announced July 2021.

    Comments: 16 pages

    ACM Class: H.2; H.5.2

  48. arXiv:2106.02898  [pdf, other

    cs.CV

    Dynamic Resolution Network

    Authors: Mingjian Zhu, Kai Han, Enhua Wu, Qiulin Zhang, Ying Nie, Zhenzhong Lan, Yunhe Wang

    Abstract: Deep convolutional neural networks (CNNs) are often of sophisticated design with numerous learnable parameters for the accuracy reason. To alleviate the expensive costs of deploying them on mobile devices, recent works have made huge efforts for excavating redundancy in pre-defined architectures. Nevertheless, the redundancy on the input resolution of modern CNNs has not been fully investigated, i… ▽ More

    Submitted 6 November, 2021; v1 submitted 5 June, 2021; originally announced June 2021.

    Comments: Accepted by NeurIPS 2021

  49. arXiv:2103.07037  [pdf, other

    cs.DB

    Reptile: Aggregation-level Explanations for Hierarchical Data

    Authors: Zezhou Huang, Eugene Wu

    Abstract: Recent query explanation systems help users understand anomalies in aggregation results by proposing predicates that describe input records that, if deleted, would resolve the anomalies. However, it can be difficult for users to understand how a predicate was chosen, and these approaches are limited to errors that can be resolved through deletion. In contrast, data errors may be due to group-wise… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

  50. arXiv:2103.00112  [pdf, other

    cs.CV cs.AI

    Transformer in Transformer

    Authors: Kai Han, An Xiao, Enhua Wu, Jianyuan Guo, Chun**g Xu, Yunhe Wang

    Abstract: Transformer is a new kind of neural architecture which encodes the input data as powerful features via the attention mechanism. Basically, the visual transformers first divide the input images into several local patches and then calculate both representations and their relationship. Since natural images are of high complexity with abundant detail and color information, the granularity of the patch… ▽ More

    Submitted 25 October, 2021; v1 submitted 26 February, 2021; originally announced March 2021.

    Comments: Accepted by NeurIPS 2021