Skip to main content

Showing 1–50 of 55 results for author: Shu, M

.
  1. arXiv:2406.15352  [pdf, other

    cs.CL

    A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick

    Authors: Nishant Balepur, Matthew Shu, Alexander Hoyle, Alison Robey, Shi Feng, Seraphina Goldfarb-Tarrant, Jordan Boyd-Graber

    Abstract: Keyword mnemonics are memorable explanations that link new terms to simpler keywords. Prior works generate mnemonics for students, but they do not guide models toward mnemonics students prefer and aid learning. We build SMART, a mnemonic generator trained on feedback from real students learning new terms. To train SMART, we first fine-tune LLaMA-2 on a curated set of user-written mnemonics. We the… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: In-Progress Preprint

  2. arXiv:2406.11271  [pdf, other

    cs.CV cs.LG

    MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

    Authors: Anas Awadalla, Le Xue, Oscar Lo, Manli Shu, Hannah Lee, Etash Kumar Guha, Matt Jordan, Sheng Shen, Mohamed Awadalla, Silvio Savarese, Caiming Xiong, Ran Xu, Ye** Choi, Ludwig Schmidt

    Abstract: Multimodal interleaved datasets featuring free-form interleaved sequences of images and text are crucial for training frontier large multimodal models (LMMs). Despite the rapid progression of open-source LMMs, there remains a pronounced scarcity of large-scale, diverse open-source multimodal interleaved datasets. In response, we introduce MINT-1T, the most extensive and diverse open-source Multimo… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2405.13628  [pdf

    cond-mat.str-el cond-mat.mtrl-sci

    Spinons in a new Shastry-Sutherland lattice magnet Pr$_2$Ga$_2$BeO$_7$

    Authors: N. Li, A. Brassington, M. F. Shu, Y. Y. Wang, H. Liang, Q. J. Li, X. Zhao, P. J. Baker, H. Kikuchi, T. Masuda, G. Duan, C. Liu, H. Wang, W. Xie, R. Zhong, J. Ma, R. Yu, H. D. Zhou, X. F. Sun

    Abstract: Identifying the elusive spinon excitations in quantum spin liquid (QSL) materials is what scientists have long sought for. Recently, thermal conductivity ($κ$) has emerged to be a decisive probe because the fermionic nature of spinons leads to a characteristic nonzero linear $κ_0/T$ term while approaching zero Kelvin. So far, only a few systems have been reported to exhibit such term. Here, we rep… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 20 pages, 6 figures, with Supplementary Information

  4. arXiv:2404.03145  [pdf, other

    cs.CV

    DreamWalk: Style Space Exploration using Diffusion Guidance

    Authors: Michelle Shu, Charles Herrmann, Richard Strong Bowen, Forrester Cole, Ramin Zabih

    Abstract: Text-conditioned diffusion models can generate impressive images, but fall short when it comes to fine-grained control. Unlike direct-editing tools like Photoshop, text conditioned models require the artist to perform "prompt engineering," constructing special text sentences to control the style or amount of a particular subject present in the output image. Our goal is to provide fine-grained cont… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  5. arXiv:2402.14020  [pdf, other

    cs.LG cs.CL cs.CR

    Coercing LLMs to do and reveal (almost) anything

    Authors: Jonas Gei**, Alex Stein, Manli Shu, Khalid Saifullah, Yuxin Wen, Tom Goldstein

    Abstract: It has recently been shown that adversarial attacks on large language models (LLMs) can "jailbreak" the model into making harmful statements. In this work, we argue that the spectrum of adversarial attacks on LLMs is much larger than merely jailbreaking. We provide a broad overview of possible attack surfaces and attack goals. Based on a series of concrete examples, we discuss, categorize and syst… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 32 pages. Implementation available at https://github.com/JonasGei**/carving

  6. arXiv:2402.12291  [pdf, other

    cs.CL

    KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students

    Authors: Matthew Shu, Nishant Balepur, Shi Feng, Jordan Boyd-Graber

    Abstract: Flashcard schedulers are tools that rely on 1) student models to predict the flashcards a student knows; and 2) teaching policies to schedule cards based on these predictions. Existing student models, however, only use flashcard-level features, like the student's past responses, ignoring the semantic ties of flashcards. Deep Knowledge Tracing (DKT) models can capture semantic relations with langua… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: In-progress preprint

  7. arXiv:2402.06659  [pdf, other

    cs.CR cs.AI cs.LG

    Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models

    Authors: Yuancheng Xu, Jiarui Yao, Manli Shu, Yanchao Sun, Zichu Wu, Ning Yu, Tom Goldstein, Furong Huang

    Abstract: Vision-Language Models (VLMs) excel in generating textual responses from visual inputs, yet their versatility raises significant security concerns. This study takes the first step in exposing VLMs' susceptibility to data poisoning attacks that can manipulate responses to innocuous, everyday prompts. We introduce Shadowcast, a stealthy data poisoning attack method where poison samples are visually… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  8. arXiv:2402.02775  [pdf

    physics.optics eess.IV physics.bio-ph

    Instant square lattice structured illumination microscopy: an optimal strategy towards photon-saving and real-time super-resolution observation

    Authors: Tianyu Zhao, Zhaojun Wang, Manming Shu, **gxiang Zhang, Yansheng Liang, Shaowei Wang, Ming Lei

    Abstract: Over the past decade, structured illumination microscopy (SIM) has found its niche in super-resolution (SR) microscopy due to its fast imaging speed and low excitation intensity. However, due to the significantly higher light dose compared to wide-field microscopy and the time-consuming post-processing procedures, long-term, real-time, super-resolution observation of living cells is still out of r… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  9. arXiv:2401.16545  [pdf

    cs.DC

    Leveraging Public Cloud Infrastructure for Real-time Connected Vehicle Speed Advisory at a Signalized Corridor

    Authors: Hsien-Wen Deng, M Sabbir Salek, Mizanur Rahman, Mashrur Chowdhury, Mitch Shue, Amy W. Apon

    Abstract: In this study, we developed a real-time connected vehicle (CV) speed advisory application that uses public cloud services and tested it on a simulated signalized corridor for different roadway traffic conditions. First, we developed a scalable serverless cloud computing architecture leveraging public cloud services offered by Amazon Web Services (AWS) to support the requirements of a real-time CV… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  10. arXiv:2312.06284  [pdf

    cond-mat.str-el

    Static magnetic order with strong quantum fluctuations in spin-1/2 honeycomb magnet Na2Co2TeO6

    Authors: Gaoting Lin, **long Jiao, Xiyang Li, Mingfang Shu, Oksana Zaharko, Toni Shiroka, Tao Hong, Alexander I. Kolesnikov, Guochu Deng, Sarah Dunsiger, Haidong Zhou, Tian Shang, Jie Ma

    Abstract: Kitaev interactions, arising from the interplay of frustration and bond anisotropy, can lead to strong quantum fluctuations and, in an ideal case, to a quantum-spin-liquid state. However, in many nonideal materials, spurious non-Kitaev interactions typically promote a zigzag antiferromagnetic order in the d-orbital transition metal compounds. By combining neutron scattering with muon-spin rotation… ▽ More

    Submitted 20 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: 28 pages, 11 figures, and 1 lable

  11. arXiv:2312.01073  [pdf

    physics.optics physics.bio-ph

    High-speed image reconstruction for nonlinear structured illumination microscopy

    Authors: **gxiang Zhang, Tianyu Zhao, Xiangda Fu, Manming Shu, Jia**g Yan, **xiao Chen, Yansheng Liang, Shaowei Wang, Ming Lei

    Abstract: By exploiting the nonlinear responses of the fluorescent probes, the spatial resolution of structured illumination microscopy(SIM) can be further increased. However, due to the complex reconstruction process, the traditional reconstruction method of nonlinear structured illumination microscopy (NL-SIM) is relatively slow, which brings a great challenge to realizing real-time display of super-resol… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  12. arXiv:2310.19909  [pdf, other

    cs.CV cs.LG

    Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks

    Authors: Micah Goldblum, Hossein Souri, Renkun Ni, Manli Shu, Viraj Prabhu, Gowthami Somepalli, Prithvijit Chattopadhyay, Mark Ibrahim, Adrien Bardes, Judy Hoffman, Rama Chellappa, Andrew Gordon Wilson, Tom Goldstein

    Abstract: Neural network based computer vision systems are typically built on a backbone, a pretrained or randomly initialized feature extractor. Several years ago, the default option was an ImageNet-trained convolutional neural network. However, the recent past has seen the emergence of countless backbones pretrained using various algorithms and datasets. While this abundance of choice has led to performan… ▽ More

    Submitted 19 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  13. arXiv:2309.04169  [pdf, other

    cs.CV

    Grou** Boundary Proposals for Fast Interactive Image Segmentation

    Authors: Li Liu, Da Chen, Minglei Shu, Laurent D. Cohen

    Abstract: Geodesic models are known as an efficient tool for solving various image segmentation problems. Most of existing approaches only exploit local pointwise image features to track geodesic paths for delineating the objective boundaries. However, such a segmentation strategy cannot take into account the connectivity of the image edge features, increasing the risk of shortcut problem, especially in the… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  14. arXiv:2309.01579  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Direct observation of topological surface states in the layered kagome lattice with broken time-reversal symmetry

    Authors: Zhicheng Jiang, Tongrui Li, Jian Yuan, Zhengtai Liu, Zhipeng Cao, Soohyun Cho, Mingfang Shu, Yichen Yang, Jianyang Ding, Zhikai Li, Jiayu Liu, Zhonghao Liu, Jishan Liu, Jie Ma, Zhe Sun, Yanfeng Guo, Dawei Shen

    Abstract: Magnetic topological quantum materials display a diverse range of fascinating physical properties which arise from their intrinsic magnetism and the breaking of time-reversal symmetry. However, so far, few examples of intrinsic magnetic topological materials have been confirmed experimentally, which significantly hinder our comprehensive understanding of the abundant physical properties in this sy… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: 9 pages, 4 figures

  15. arXiv:2308.15729  [pdf, other

    cs.CG math.NA

    Computing Geodesic Paths Encoding a Curvature Prior

    Authors: Da Chen, Jean-Marie Mirebeau, Minglei Shu, Laurent D. Cohen

    Abstract: In this paper, we introduce an efficient method for computing curves minimizing a variant of the Euler-Mumford elastica energy, with fixed endpoints and tangents at these endpoints, where the bending energy is enhanced with a user defined and data-driven scalar-valued term referred to as the curvature prior. In order to guarantee that the globally optimal curve is extracted, the proposed method in… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  16. arXiv:2306.17194  [pdf, other

    cs.CR cs.CL cs.LG

    On the Exploitability of Instruction Tuning

    Authors: Manli Shu, Jiongxiao Wang, Chen Zhu, Jonas Gei**, Chaowei Xiao, Tom Goldstein

    Abstract: Instruction tuning is an effective technique to align large language models (LLMs) with human intents. In this work, we investigate how an adversary can exploit instruction tuning by injecting specific instruction-following examples into the training data that intentionally changes the model's behavior. For example, an adversary can achieve content injection by injecting training examples that men… ▽ More

    Submitted 28 October, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 camera-ready (21 pages, 10 figures)

  17. arXiv:2306.13651  [pdf, other

    cs.CL cs.LG

    Bring Your Own Data! Self-Supervised Evaluation for Large Language Models

    Authors: Neel Jain, Khalid Saifullah, Yuxin Wen, John Kirchenbauer, Manli Shu, Aniruddha Saha, Micah Goldblum, Jonas Gei**, Tom Goldstein

    Abstract: With the rise of Large Language Models (LLMs) and their ubiquitous deployment in diverse domains, measuring language model behavior on realistic data is imperative. For example, a company deploying a client-facing chatbot must ensure that the model will not respond to client requests with profanity. Current evaluations approach this problem using small, domain-specific datasets with human-curated… ▽ More

    Submitted 29 June, 2023; v1 submitted 23 June, 2023; originally announced June 2023.

    Comments: Code is available at https://github.com/neelsjain/BYOD. First two authors contributed equally. 21 pages, 22 figures

  18. arXiv:2306.05802  [pdf, other

    cond-mat.str-el

    Static and dynamical properties of the spin-5/2 nearly ideal triangular lattice antiferromagnet Ba3MnSb2O9

    Authors: Mingfang Shu, Weicen Dong, **long Jiao, Jiangtao Wu, Gaoting lin, Tao Hong, Huibo Cao, Masaaki Matsuda, Wei Tian, Songxue Chi, Georg Ehlers, Zhongwen Ouyang, Hongwei Chen, Youming Zou, Zhe Qu, Qing Huang, Haidong Zhou, Yoshitomo Kamiya, Jie Ma

    Abstract: We study the ground state and spin excitations in Ba3MnSb2O9, an easy-plane S = 5/2 triangular lattice antiferromagnet. By combining single-crystal neutron scattering, electric spin resonance (ESR), and spin wave calculations, we determine the frustrated quasi-two-dimensional spin Hamiltonian parameters describing the material. While the material has a slight monoclinic structural distortion, whic… ▽ More

    Submitted 7 September, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

  19. arXiv:2306.04634  [pdf, other

    cs.LG cs.CL cs.CR

    On the Reliability of Watermarks for Large Language Models

    Authors: John Kirchenbauer, Jonas Gei**, Yuxin Wen, Manli Shu, Khalid Saifullah, Kezhi Kong, Kasun Fernando, Aniruddha Saha, Micah Goldblum, Tom Goldstein

    Abstract: As LLMs become commonplace, machine-generated text has the potential to flood the internet with spam, social media bots, and valueless content. Watermarking is a simple and effective strategy for mitigating such harms by enabling the detection and documentation of LLM-generated text. Yet a crucial question remains: How reliable is watermarking in realistic settings in the wild? There, watermarked… ▽ More

    Submitted 1 May, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 9 pages in the main body. Published at ICLR 2024. Code is available at https://github.com/jwkirchenbauer/lm-watermarking

  20. arXiv:2304.09391  [pdf

    cs.CV cs.AI

    Inferring High-level Geographical Concepts via Knowledge Graph and Multi-scale Data Integration: A Case Study of C-shaped Building Pattern Recognition

    Authors: Zhiwei Wei, Yi Xiao, Wenjia Xu, Mi Shu, Lu Cheng, Yang Wang, Chunbo Liu

    Abstract: Effective building pattern recognition is critical for understanding urban form, automating map generalization, and visualizing 3D city models. Most existing studies use object-independent methods based on visual perception rules and proximity graph models to extract patterns. However, because human vision is a part-based system, pattern recognition may require decomposing shapes into parts or gro… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  21. arXiv:2301.02650  [pdf, other

    cs.CV

    Hierarchical Point Attention for Indoor 3D Object Detection

    Authors: Manli Shu, Le Xue, Ning Yu, Roberto Martín-Martín, Caiming Xiong, Tom Goldstein, Juan Carlos Niebles, Ran Xu

    Abstract: 3D object detection is an essential vision technique for various robotic systems, such as augmented reality and domestic robots. Transformers as versatile network architectures have recently seen great success in 3D point cloud object detection. However, the lack of hierarchy in a plain transformer restrains its ability to learn features at different scales. Such limitation makes transformer detec… ▽ More

    Submitted 8 May, 2024; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: ICRA 2024 camera-ready (7 pages, 5 figures)

  22. arXiv:2212.06727  [pdf, other

    cs.CV

    What do Vision Transformers Learn? A Visual Exploration

    Authors: Amin Ghiasi, Hamid Kazemi, Eitan Borgnia, Steven Reich, Manli Shu, Micah Goldblum, Andrew Gordon Wilson, Tom Goldstein

    Abstract: Vision transformers (ViTs) are quickly becoming the de-facto architecture for computer vision, yet we understand very little about why they work and what they learn. While existing studies visually analyze the mechanisms of convolutional neural networks, an analogous exploration of ViTs remains challenging. In this paper, we first address the obstacles to performing visualizations on ViTs. Assiste… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

  23. arXiv:2209.07511  [pdf, other

    cs.CV

    Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models

    Authors: Manli Shu, Weili Nie, De-An Huang, Zhiding Yu, Tom Goldstein, Anima Anandkumar, Chaowei Xiao

    Abstract: Pre-trained vision-language models (e.g., CLIP) have shown promising zero-shot generalization in many downstream tasks with properly designed text prompts. Instead of relying on hand-engineered prompts, recent works learn prompts using the training data from downstream tasks. While effective, training on domain-specific data reduces a model's generalization capability to unseen new domains. In thi… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: NeurIPS 2022

  24. arXiv:2208.07237  [pdf, ps, other

    cs.LG cs.AI

    Energy and Spectrum Efficient Federated Learning via High-Precision Over-the-Air Computation

    Authors: Liang Li, Chenpei Huang, Dian Shi, Hao Wang, Xiangwei Zhou, Minglei Shu, Miao Pan

    Abstract: Federated learning (FL) enables mobile devices to collaboratively learn a shared prediction model while kee** data locally. However, there are two major research challenges to practically deploy FL over mobile devices: (i) frequent wireless updates of huge size gradients v.s. limited spectrum resources, and (ii) energy-hungry FL communication and local computing during training v.s. battery-cons… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

  25. arXiv:2204.05575  [pdf, other

    cs.CV cs.AI

    DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection

    Authors: Haibao Yu, Yizhen Luo, Mao Shu, Yiyi Huo, Zebang Yang, Yifeng Shi, Zhenglong Guo, Hanyu Li, Xing Hu, Jirui Yuan, Zaiqing Nie

    Abstract: Autonomous driving faces great safety challenges for a lack of global perspective and the limitation of long-range perception capabilities. It has been widely agreed that vehicle-infrastructure cooperation is required to achieve Level 5 autonomy. However, there is still NO dataset from real scenarios available for computer vision researchers to work on vehicle-infrastructure cooperation-related pr… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: CVPR2022

  26. arXiv:2203.13608  [pdf, other

    cs.CV

    Rope3D: TheRoadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task

    Authors: Xiaoqing Ye, Mao Shu, Hanyu Li, Yifeng Shi, Yingying Li, Guangjie Wang, Xiao Tan, Errui Ding

    Abstract: Concurrent perception datasets for autonomous driving are mainly limited to frontal view with sensors mounted on the vehicle. None of them is designed for the overlooked roadside perception tasks. On the other hand, the data captured from roadside cameras have strengths over frontal-view data, which is believed to facilitate a safer and more intelligent autonomous driving system. To accelerate the… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: To appear in CVPR2022

  27. arXiv:2111.11180  [pdf

    cond-mat.mtrl-sci quant-ph

    Regulate the direct-indirect electronic band gap transition by electron-phonon interaction in BaSnO3

    Authors: Binru Zhao, Qing Huang, Jiangtao Wu, **long Jiao, Mingfang Shu, Gaoting Lin, Qiyang Sun, Ranran Zhang, Masato Hagihala, Shuki Torri, Guohua Wang, Qingyong Ren, Chen Li, Zhe Qu, Haidong Zhou, Jie Ma

    Abstract: The neutron powder diffraction, specific heat, thermal conductivity, and Raman scattering measurements were presented to study the interplays of lattice, phonons and electrons of the Sr-do** Ba1-xSrxSnO3 (x was less than or equal to 0.1). Although Ba1-xSrxSnO3 kept the cubic lattice, the Raman spectra suggested a dynamic distortion at low temperature. The density functional theory was applied to… ▽ More

    Submitted 11 April, 2022; v1 submitted 13 November, 2021; originally announced November 2021.

    Comments: 23 pages, 7 figures, 3 tables, no supplemental materials

  28. arXiv:2111.00794  [pdf, other

    cs.CV

    Geodesic Models with Convexity Shape Prior

    Authors: Da Chen, Jean-Marie Mirebeau, Minglei Shu, Xuecheng Tai, Laurent D. Cohen

    Abstract: The minimal geodesic models based on the Eikonal equations are capable of finding suitable solutions in various image segmentation scenarios. Existing geodesic-based segmentation approaches usually exploit image features in conjunction with geometric regularization terms, such as Euclidean curve length or curvature-penalized length, for computing geodesic curves. In this paper, we take into accoun… ▽ More

    Submitted 25 November, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: This paper has been accepted by TPAMI

  29. arXiv:2111.00637  [pdf, other

    cs.LG cs.DC

    To Talk or to Work: Delay Efficient Federated Learning over Mobile Edge Devices

    Authors: Pavana Prakash, Jiahao Ding, Maoqiang Wu, Minglei Shu, Rong Yu, Miao Pan

    Abstract: Federated learning (FL), an emerging distributed machine learning paradigm, in conflux with edge computing is a promising area with novel applications over mobile edge devices. In FL, since mobile devices collaborate to train a model based on their own data under the coordination of a central server by sharing just the model updates, training data is maintained private. However, without the centra… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

    Comments: Accepted for publication in Globecom'21

  30. arXiv:2108.09641  [pdf, other

    eess.IV cs.CV

    Deep survival analysis with longitudinal X-rays for COVID-19

    Authors: Michelle Shu, Richard Strong Bowen, Charles Herrmann, Gengmo Qi, Michele Santacatterina, Ramin Zabih

    Abstract: Time-to-event analysis is an important statistical tool for allocating clinical resources such as ICU beds. However, classical techniques like the Cox model cannot directly incorporate images due to their high dimensionality. We propose a deep learning approach that naturally incorporates multiple, time-dependent imaging studies as well as non-imaging data into time-to-event analysis. Our techniqu… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

  31. arXiv:2108.04430  [pdf, other

    cs.CY cs.LG

    Enhancing Knowledge Tracing via Adversarial Training

    Authors: Xiaopeng Guo, Zhijie Huang, Jie Gao, Mingyu Shang, Mao**g Shu, Jun Sun

    Abstract: We study the problem of knowledge tracing (KT) where the goal is to trace the students' knowledge mastery over time so as to make predictions on their future performance. Owing to the good representation capacity of deep neural networks (DNNs), recent advances on KT have increasingly concentrated on exploring DNNs to improve the performance of KT. However, we empirically reveal that the DNNs based… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: Accepted by ACM MM 2021

  32. arXiv:2108.01335  [pdf, other

    cs.CV cs.LG

    Where do Models go Wrong? Parameter-Space Saliency Maps for Explainability

    Authors: Roman Levin, Manli Shu, Eitan Borgnia, Furong Huang, Micah Goldblum, Tom Goldstein

    Abstract: Conventional saliency maps highlight input features to which neural network predictions are highly sensitive. We take a different approach to saliency, in which we identify and analyze the network parameters, rather than inputs, which are responsible for erroneous decisions. We find that samples which cause similar parameters to malfunction are semantically similar. We also show that pruning the m… ▽ More

    Submitted 9 October, 2022; v1 submitted 3 August, 2021; originally announced August 2021.

  33. arXiv:2102.13262  [pdf, other

    cs.CV cs.LG cs.RO

    Improving Robustness of Learning-based Autonomous Steering Using Adversarial Images

    Authors: Yu Shen, Laura Zheng, Manli Shu, Weizi Li, Tom Goldstein, Ming C. Lin

    Abstract: For safety of autonomous driving, vehicles need to be able to drive under various lighting, weather, and visibility conditions in different environments. These external and environmental factors, along with internal factors associated with sensors, can pose significant challenges to perceptual data processing, hence affecting the decision-making and control of the vehicle. In this work, we address… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

  34. arXiv:2101.03625  [pdf

    q-fin.RM econ.GN q-fin.ST

    The 'COVID' Crash of the 2020 U.S. Stock Market

    Authors: Min Shu, Ruiqiang Song, Wei Zhu

    Abstract: We employed the log-periodic power law singularity (LPPLS) methodology to systematically investigate the 2020 stock market crash in the U.S. equities sectors with different levels of total market capitalizations through four major U.S. stock market indexes, including the Wilshire 5000 Total Market index, the S&P 500 index, the S&P MidCap 400 index, and the Russell 2000 index, representing the stoc… ▽ More

    Submitted 10 January, 2021; originally announced January 2021.

    Comments: 19 pages, 3 figures. arXiv admin note: text overlap with arXiv:2101.00327

  35. arXiv:2101.00327  [pdf

    q-fin.RM econ.GN q-fin.GN

    The 2020 Global Stock Market Crash: Endogenous or Exogenous?

    Authors: Ruiqiang Song, Min Shu, Wei Zhu

    Abstract: Starting on February 20, 2020, the global stock markets began to suffer the worst decline since the Great Recession in 2008, and the COVID-19 has been widely blamed on the stock market crashes. In this study, we applied the log-periodic power law singularity (LPPLS) methodology based on multilevel time series to unravel the underlying mechanisms of the 2020 global stock market crash by analyzing t… ▽ More

    Submitted 1 January, 2021; originally announced January 2021.

    Comments: 25 pages, 4 figures

  36. arXiv:2010.07334  [pdf, other

    cs.LG cs.CV

    Towards Accurate Quantization and Pruning via Data-free Knowledge Transfer

    Authors: Chen Zhu, Zheng Xu, Ali Shafahi, Manli Shu, Amin Ghiasi, Tom Goldstein

    Abstract: When large scale training data is available, one can obtain compact and accurate networks to be deployed in resource-constrained environments effectively through quantization and pruning. However, training data are often protected due to privacy concerns and it is challenging to obtain compact networks without data. We study data-free quantization and pruning by transferring knowledge from trained… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

  37. arXiv:2010.05210  [pdf, other

    cs.CV

    Generalized Few-shot Semantic Segmentation

    Authors: Zhuotao Tian, Xin Lai, Li Jiang, Shu Liu, Michelle Shu, Hengshuang Zhao, Jiaya Jia

    Abstract: Training semantic segmentation models requires a large amount of finely annotated data, making it hard to quickly adapt to novel classes not satisfying this condition. Few-Shot Segmentation (FS-Seg) tackles this problem with many constraints. In this paper, we introduce a new benchmark, called Generalized Few-Shot Semantic Segmentation (GFS-Seg), to analyze the generalization ability of simultaneo… ▽ More

    Submitted 31 May, 2022; v1 submitted 11 October, 2020; originally announced October 2020.

    Comments: Accepted to CVPR 2022

  38. arXiv:2009.08965  [pdf, other

    cs.CV cs.LG

    Encoding Robustness to Image Style via Adversarial Feature Perturbations

    Authors: Manli Shu, Zuxuan Wu, Micah Goldblum, Tom Goldstein

    Abstract: Adversarial training is the industry standard for producing models that are robust to small adversarial perturbations. However, machine learning practitioners need models that are robust to other kinds of changes that occur naturally, such as changes in the style or illumination of input images. Such changes in input distribution have been effectively modeled as shifts in the mean and variance of… ▽ More

    Submitted 31 October, 2021; v1 submitted 18 September, 2020; originally announced September 2020.

    Comments: NeurIPS 2021

  39. arXiv:2008.07290  [pdf

    cs.DC

    Commercial Cloud Computing for Connected Vehicle Applications in Transportation Cyber-Physical Systems

    Authors: Hsien-Wen Deng, Mizanur Rahman, Mashrur Chowdhury, M Sabbir Salek, Mitch Shue

    Abstract: This study focuses on the feasibility of commercial cloud services for connected vehicle (CV) applications in a Transportation Cyber-Physical Systems (TCPS) environment. TCPS implies that CVs, in addition to being connected with each other, communicates with the transportation and computing infrastructure to fulfill application requirements. The motivation of this study is to accelerate commercial… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: 15 pages, 9 figures

  40. Geodesic Paths for Image Segmentation with Implicit Region-based Homogeneity Enhancement

    Authors: Da Chen, Jian Zhu, Xinxin Zhang, Minglei Shu, Laurent D. Cohen

    Abstract: Minimal paths are regarded as a powerful and efficient tool for boundary detection and image segmentation due to its global optimality and the well-established numerical solutions such as fast marching method. In this paper, we introduce a flexible interactive image segmentation model based on the Eikonal partial differential equation (PDE) framework in conjunction with region-based homogeneity en… ▽ More

    Submitted 6 May, 2021; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: Published in IEEE Trans. Image Processing

  41. arXiv:2008.01449  [pdf, other

    cs.CV

    Prior Guided Feature Enrichment Network for Few-Shot Segmentation

    Authors: Zhuotao Tian, Hengshuang Zhao, Michelle Shu, Zhicheng Yang, Ruiyu Li, Jiaya Jia

    Abstract: State-of-the-art semantic segmentation methods require sufficient labeled data to achieve good results and hardly work on unseen classes without fine-tuning. Few-shot segmentation is thus proposed to tackle this problem by learning a model that quickly adapts to new classes with a few labeled support samples. Theses frameworks still face the challenge of generalization ability reduction on unseen… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: 16 pages. To appear in TPAMI

  42. A Generalized Asymmetric Dual-front Model for Active Contours and Image Segmentation

    Authors: Da Chen, Jack Spencer, Jean-Marie Mirebeau, Ke Chen, Minglei Shu, Laurent D. Cohen

    Abstract: The Voronoi diagram-based dual-front active contour models are known as a powerful and efficient way for addressing the image segmentation and domain partitioning problems. In the basic formulation of the dual-front models, the evolving contours can be considered as the interfaces of adjacent Voronoi regions. Among these dual-front models, a crucial ingredient is regarded as the geodesic metrics b… ▽ More

    Submitted 4 May, 2021; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: Published in IEEE Transactions on Image Processing

  43. arXiv:2006.06669  [pdf, other

    cs.CV

    Understanding Human Hands in Contact at Internet Scale

    Authors: Dandan Shan, Jiaqi Geng, Michelle Shu, David F. Fouhey

    Abstract: Hands are the central means by which humans manipulate their world and being able to reliably extract hand state information from Internet videos of humans engaged in their hands has the potential to pave the way to systems that can learn from petabytes of video data. This paper proposes steps towards this by inferring a rich representation of hands engaged in interaction method that includes: han… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: To appear at CVPR 2020 (Oral). Project and dataset webpage: http://fouheylab.eecs.umich.edu/~dandans/projects/100DOH/

  44. arXiv:2005.07343  [pdf, other

    eess.IV cs.CV

    Visual Perception Model for Rapid and Adaptive Low-light Image Enhancement

    Authors: Xiaoxiao Li, Xiaopeng Guo, Liye Mei, Mingyu Shang, Jie Gao, Mao**g Shu, Xiang Wang

    Abstract: Low-light image enhancement is a promising solution to tackle the problem of insufficient sensitivity of human vision system (HVS) to perceive information in low light environments. Previous Retinex-based works always accomplish enhancement task by estimating light intensity. Unfortunately, single light intensity modelling is hard to accurately simulate visual perception information, leading to th… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: Due to the limitation "The abstract field cannot be longer than 1,920 characters", the abstract here is shorter than that in the PDF file

  45. Headless Horseman: Adversarial Attacks on Transfer Learning Models

    Authors: Ahmed Abdelkader, Michael J. Curry, Liam Fowl, Tom Goldstein, Avi Schwarzschild, Manli Shu, Christoph Studer, Chen Zhu

    Abstract: Transfer learning facilitates the training of task-specific classifiers using pre-trained models as feature extractors. We present a family of transferable adversarial attacks against such classifiers, generated without access to the classification head; we call these \emph{headless attacks}. We first demonstrate successful transfer attacks against a victim network using \textit{only} its feature… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

    Comments: 5 pages, 2 figures. Accepted in ICASSP 2020. Code available on https://github.com/zhuchen03/headless-attack.git

  46. arXiv:2003.03710  [pdf, other

    cs.CV

    Trajectory Grou** with Curvature Regularization for Tubular Structure Tracking

    Authors: Li Liu, Da Chen, Minglei Shu, Baosheng Li, Huazhong Shu, Michel Paques, Laurent D. Cohen

    Abstract: Tubular structure tracking is a crucial task in the fields of computer vision and medical image analysis. The minimal paths-based approaches have exhibited their strong ability in tracing tubular structures, by which a tubular structure can be naturally modeled as a minimal geodesic path computed with a suitable geodesic metric. However, existing minimal paths-based tracing approaches still suffer… ▽ More

    Submitted 8 December, 2021; v1 submitted 7 March, 2020; originally announced March 2020.

  47. arXiv:1911.11230  [pdf, other

    cs.CV cs.LG

    Identifying Model Weakness with Adversarial Examiner

    Authors: Michelle Shu, Chenxi Liu, Weichao Qiu, Alan Yuille

    Abstract: Machine learning models are usually evaluated according to the average case performance on the test set. However, this is not always ideal, because in some sensitive domains (e.g. autonomous driving), it is the worst case performance that matters more. In this paper, we are interested in systematic exploration of the input data space to identify the weakness of the model to be evaluated. We propos… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Comments: To appear in AAAI-20

  48. arXiv:1906.11443  [pdf, other

    cs.CV

    Region Refinement Network for Salient Object Detection

    Authors: Zhuotao Tian, Hengshuang Zhao, Michelle Shu, Jiaze Wang, Ruiyu Li, Xiaoyong Shen, Jiaya Jia

    Abstract: Albeit intensively studied, false prediction and unclear boundaries are still major issues of salient object detection. In this paper, we propose a Region Refinement Network (RRN), which recurrently filters redundant information and explicitly models boundary information for saliency detection. Different from existing refinement methods, we propose a Region Refinement Module (RRM) that optimizes s… ▽ More

    Submitted 9 October, 2022; v1 submitted 27 June, 2019; originally announced June 2019.

    Comments: Tech report

  49. arXiv:1906.03337  [pdf

    cs.AI

    Extension of Rough Set Based on Positive Transitive Relation

    Authors: Min Shu, Wei Zhu

    Abstract: The application of rough set theory in incomplete information systems is a key problem in practice since missing values almost always occur in knowledge acquisition due to the error of data measuring, the limitation of data collection, or the limitation of data comprehension, etc. An incomplete information system is mainly processed by compressing the indiscernibility relation. The existing rough… ▽ More

    Submitted 13 June, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

    Comments: 9 pages

  50. arXiv:1905.09647  [pdf

    q-fin.ST q-fin.RM stat.AP

    Real-time Prediction of Bitcoin Bubble Crashes

    Authors: Min Shu, Wei Zhu

    Abstract: In the past decade, Bitcoin as an emerging asset class has gained widespread public attention because of their extraordinary returns in phases of extreme price growth and their unpredictable massive crashes. We apply the log-periodic power law singularity (LPPLS) confidence indicator as a diagnostic tool for identifying bubbles using the daily data on Bitcoin price in the past two years. We find t… ▽ More

    Submitted 13 June, 2019; v1 submitted 23 May, 2019; originally announced May 2019.

    Comments: 25 pages, 5 figures

    MSC Class: 91G70