Skip to main content

Showing 251–300 of 1,301 results for author: Wang, A

.
  1. arXiv:2307.06873  [pdf, other

    math.OC

    Sharpness and well-conditioning of nonsmooth convex formulations in statistical signal recovery

    Authors: Lijun Ding, Alex L. Wang

    Abstract: We study a sample complexity vs. conditioning tradeoff in modern signal recovery problems where convex optimization problems are built from sampled observations. We begin by introducing a set of condition numbers related to sharpness in $\ell_p$ or Schatten-p norms ($p\in[1,2]$) based on nonsmooth reformulations of a class of convex optimization problems, including sparse recovery, low-rank matrix… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  2. arXiv:2307.06567  [pdf, other

    cond-mat.mes-hall

    A Versatile Method of Engineering the Electron Wavefunction of Hybrid Quantum Devices

    Authors: Guoan Li, Guang Yang, Ting Lin, M. Rossi, G. Badawy, Zhiyuan Zhang, Xiaofan Shi, Jiayu Shi, Degui Qian, Fang Lu, Lin Gu, An-Qi Wang, Zhaozheng Lyu, Guangtong Liu, Fanming Qu, Ziwei Dou, Qinghua Zhang, E. P. A. M. Bakkers, M. P. Nowak, P. Wójcik, Li Lu, Jie Shen

    Abstract: With the development of quantum technology, hybrid devices that combine superconductors (S) and semiconductors (Sm) have attracted great attention due to the possibility of engineering structures that benefit from the integration of the properties of both materials. However, until now, none of the experiments have reported good control of band alignment at the interface, which determines the stren… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: 18 pages, 9 figures

  3. arXiv:2307.05898  [pdf, other

    cs.CV

    Rectifying Noisy Labels with Sequential Prior: Multi-Scale Temporal Feature Affinity Learning for Robust Video Segmentation

    Authors: Beilei Cui, Minqing Zhang, Mengya Xu, An Wang, Wu Yuan, Hongliang Ren

    Abstract: Noisy label problems are inevitably in existence within medical image segmentation causing severe performance degradation. Previous segmentation methods for noisy label problems only utilize a single image while the potential of leveraging the correlation between images has been overlooked. Especially for video segmentation, adjacent frames contain rich contextual information beneficial in cognizi… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Accepted by MICCAI 2023

  4. arXiv:2307.05468  [pdf, other

    cs.CV

    My3DGen: A Scalable Personalized 3D Generative Model

    Authors: Luchao Qi, Jiaye Wu, Annie N. Wang, Shengze Wang, Roni Sengupta

    Abstract: In recent years, generative 3D face models (e.g., EG3D) have been developed to tackle the problem of synthesizing photo-realistic faces. However, these models are often unable to capture facial features unique to each individual, highlighting the importance of personalization. Some prior works have shown promise in personalizing generative face models, but these studies primarily focus on 2D setti… ▽ More

    Submitted 20 May, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Project page: https://luchaoqi.com/my3dgen/

  5. arXiv:2307.03012  [pdf, other

    gr-qc astro-ph.HE hep-th

    A Kerr-Newman-MOG black hole's impact on the magnetic reconnection

    Authors: Sanjar Shaymatov, Mirzabek Alloqulov, Bobomurat Ahmedov, Anzhong Wang

    Abstract: In this paper, we study the magnetic reconnection process of energy extraction from a rapidly rotating Kerr-Newman-MOG black hole by investigating the combined effect of black hole charge and the MOG parameter. We explore the energy efficiency of energy extraction and power by applying the new energy extraction mechanism proposed by Comisso and Asenjo. Based on an attractive gravitational charge o… ▽ More

    Submitted 12 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: 15 pages, one table, 7 captioned figures. Some inaccurate statements corrected, the result remains unaltered

  6. arXiv:2307.02626  [pdf, ps, other

    cs.DB cs.AI

    Real-time Workload Pattern Analysis for Large-scale Cloud Databases

    Authors: Jiaqi Wang, Tianyi Li, Anni Wang, Xiaoze Liu, Lu Chen, Jie Chen, Jianye Liu, Junyang Wu, Feifei Li, Yunjun Gao

    Abstract: Hosting database services on cloud systems has become a common practice. This has led to the increasing volume of database workloads, which provides the opportunity for pattern analysis. Discovering workload patterns from a business logic perspective is conducive to better understanding the trends and characteristics of the database system. However, existing workload pattern discovery systems are… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: Proceedings of the VLDB Volume 16 (VLDB 2023)

  7. arXiv:2307.02452  [pdf, other

    eess.IV cs.CV cs.RO

    LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusion

    Authors: Long Bai, Tong Chen, Yanan Wu, An Wang, Mobarakol Islam, Hongliang Ren

    Abstract: Wireless capsule endoscopy (WCE) is a painless and non-invasive diagnostic tool for gastrointestinal (GI) diseases. However, due to GI anatomical constraints and hardware manufacturing limitations, WCE vision signals may suffer from insufficient illumination, leading to a complicated screening and examination procedure. Deep learning-based low-light image enhancement (LLIE) in the medical field gr… ▽ More

    Submitted 22 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: To appear in MICCAI 2023. Code availability: https://github.com/longbai1006/LLCaps

  8. arXiv:2307.02052  [pdf

    stat.ME

    Replicability of Simulation Studies for the Investigation of Statistical Methods: The RepliSims Project

    Authors: K. Luijken, A. Lohmann, U. Alter, J. Claramunt Gonzalez, F. J. Clouth, J. L. Fossum, L. Hesen, A. H. J. Huizing, J. Ketelaar, A. K. Montoya, L. Nab, R. C. C. Nijman, B. B. L. Penning de Vries, T. D. Tibbe, Y. A. Wang, R. H. H. Groenwold

    Abstract: Results of simulation studies evaluating the performance of statistical methods are often considered actionable and thus can have a major impact on the way empirical research is implemented. However, so far there is limited evidence about the reproducibility and replicability of statistical simulation studies. Therefore, eight highly cited statistical simulation studies were selected, and their re… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 36 pages, 0 figures

  9. arXiv:2306.16752  [pdf, other

    astro-ph.IM

    Rapid FRD determination for multiplexed fibre systems -- I. The quasi-near field model and its uncertainties

    Authors: Weimin Sun, Xudong Chen, Jiabin Wang, Hang Jiang, Anzhi Wang, Qi Yan, Zhenyu Ma, Shengjia Wang, Tao Geng, Yue Zhong, Zhongquan Qu, Yunxiang Yan

    Abstract: Focal Ratio Degradation (FRD) in fibres is a crucial factor to control in astronomical instruments in order to minimize light loss. As astronomical instrumentation has advanced, the integration of large populations of fibres has become common. However, determining FRD in multiplexed fibre systems has become a challenging and time-consuming task. The Integral Field Unit for the Fiber Arrayed Solar… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: 10 pages, 12 figures, submitted to MNRAS

  10. arXiv:2306.16285  [pdf, other

    eess.IV cs.CV

    Generalizing Surgical Instruments Segmentation to Unseen Domains with One-to-Many Synthesis

    Authors: An Wang, Mobarakol Islam, Mengya Xu, Hongliang Ren

    Abstract: Despite their impressive performance in various surgical scene understanding tasks, deep learning-based methods are frequently hindered from deploying to real-world surgical applications for various causes. Particularly, data collection, annotation, and domain shift in-between sites and patients are the most common obstacles. In this work, we mitigate data-related issues by efficiently leveraging… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: First two authors contributed equally. Accepted by IROS2023

  11. arXiv:2306.15691  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Clean BN encapsulated 2D FETs with lithography compatible contacts

    Authors: Binxi Liang, Anjian Wang, Jian Zhou, Shihao Ju, Jian Chen, Kenji Watanabe, Takashi Taniguchi, Yi Shi, Songlin Li

    Abstract: Device passivation through ultraclean hexagonal BN encapsulation is proven one of the most effective ways for constructing high-quality devices with atomically thin semiconductors that preserves the ultraclean interface quality and intrinsic charge transport behavior. However, it remains challenging to integrate lithography compatible contact electrodes with flexible distributions and patterns. He… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: 17 pages, 4 figures

    Journal ref: ACS Applied Materials & Interfaces, 14, 18697 (2022)

  12. arXiv:2306.13794  [pdf, other

    stat.ML cs.LG

    Tensor Dirichlet Process Multinomial Mixture Model for Passenger Trajectory Clustering

    Authors: Ziyue Li, Hao Yan, Chen Zhang, Andi Wang, Wolfgang Ketter, Lijun Sun, Fugee Tsung

    Abstract: Passenger clustering based on travel records is essential for transportation operators. However, existing methods cannot easily cluster the passengers due to the hierarchical structure of the passenger trip information, namely: each passenger has multiple trips, and each trip contains multi-dimensional multi-mode information. Furthermore, existing approaches rely on an accurate specification of th… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: Under Review of Transportation Research Part C: Emerging Technologies

  13. arXiv:2306.12109  [pdf, other

    eess.IV cs.CV

    DiffuseIR:Diffusion Models For Isotropic Reconstruction of 3D Microscopic Images

    Authors: Mingjie Pan, Yulu Gan, Fangxu Zhou, Jiaming Liu, Aimin Wang, Shanghang Zhang, Dawei Li

    Abstract: Three-dimensional microscopy is often limited by anisotropic spatial resolution, resulting in lower axial resolution than lateral resolution. Current State-of-The-Art (SoTA) isotropic reconstruction methods utilizing deep neural networks can achieve impressive super-resolution performance in fixed imaging settings. However, their generality in practical use is limited by degraded performance cause… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  14. arXiv:2306.11565  [pdf, other

    cs.RO cs.AI cs.CV

    HomeRobot: Open-Vocabulary Mobile Manipulation

    Authors: Sriram Yenamandra, Arun Ramachandran, Karmesh Yadav, Austin Wang, Mukul Khanna, Theophile Gervet, Tsung-Yen Yang, Vidhi Jain, Alexander William Clegg, John Turner, Zsolt Kira, Manolis Savva, Angel Chang, Devendra Singh Chaplot, Dhruv Batra, Roozbeh Mottaghi, Yonatan Bisk, Chris Paxton

    Abstract: HomeRobot (noun): An affordable compliant robot that navigates homes and manipulates a wide range of objects in order to complete everyday tasks. Open-Vocabulary Mobile Manipulation (OVMM) is the problem of picking any object in any unseen environment, and placing it in a commanded location. This is a foundational challenge for robots to be useful assistants in human environments, because it invol… ▽ More

    Submitted 10 January, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: 37 pages, 22 figures, 8 tables

  15. arXiv:2306.09285  [pdf

    cond-mat.mes-hall cond-mat.other

    Quantum metric-induced nonlinear transport in a topological antiferromagnet

    Authors: Naizhou Wang, Daniel Kaplan, Zhaowei Zhang, Tobias Holder, Ning Cao, Aifeng Wang, Xiaoyuan Zhou, Feifei Zhou, Zhengzhi Jiang, Chusheng Zhang, Shihao Ru, Hongbing Cai, Kenji Watanabe, Takashi Taniguchi, Binghai Yan, Weibo Gao

    Abstract: The Berry curvature and quantum metric are the imaginary part and real part, respectively, of the quantum geometric tensor which characterizes the topology of quantum states. The former is known to generate a zoo of important discoveries such as quantum Hall effect and anomalous Hall effect (AHE), while the consequences of the quantum metric have rarely been probed by transport. In this work, we o… ▽ More

    Submitted 1 July, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: 23 pages, 6 figures for the manuscript; Supplementary information included

    Journal ref: Nature (2023)

  16. arXiv:2306.08997   

    cs.CL cs.AI cs.LG

    Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models

    Authors: Sarah J. Zhang, Samuel Florin, Ariel N. Lee, Eamon Niknafs, Andrei Marginean, Annie Wang, Keith Tyser, Zad Chin, Yann Hicke, Nikhil Singh, Madeleine Udell, Yoon Kim, Tonio Buonassisi, Armando Solar-Lezama, Iddo Drori

    Abstract: We curate a comprehensive dataset of 4,550 questions and solutions from problem sets, midterm exams, and final exams across all MIT Mathematics and Electrical Engineering and Computer Science (EECS) courses required for obtaining a degree. We evaluate the ability of large language models to fulfill the graduation requirements for any MIT major in Mathematics and EECS. Our results demonstrate that… ▽ More

    Submitted 24 June, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Did not receive permission to release the data or model fine-tuned on the data

  17. arXiv:2306.08478  [pdf, other

    cond-mat.mes-hall

    Interfering Josephson diode effect and magnetochiral anisotropy in Ta2Pd3Te5 asymmetric edge interferometer

    Authors: Yupeng Li, Dayu Yan, Yu Hong, Haohao Sheng, Anqi Wang, Ziwei Dou, Xingchen Guo, Xiaofan Shi, Zikang Su, Zhaozheng Lyu, Tian Qian, Guangtong Liu, Fanming Qu, Kun Jiang, Zhijun Wang, Youguo Shi, Zhu-An Xu, Jiang** Hu, Li Lu, Jie Shen

    Abstract: Edge states in topological systems have attracted great interest due to their robustness and linear dispersions. Here a superconducting-proximitized edge interferometer is engineered on a topological insulator Ta2Pd3Te5 with asymmetric edges to realize the interfering Josephson diode effect (JDE), which hosts many advantages, such as the high efficiency as much as 73% at tiny applied magnetic fiel… ▽ More

    Submitted 2 June, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: 29 pages,21 figures

  18. arXiv:2306.08343  [pdf

    stat.AP

    A Unified Probabilistic Framework for Spatiotemporal Passenger Crowdedness Inference within Urban Rail Transit Network

    Authors: Min Jiang, Andi Wang, Ziyue Li, Fugee Tsung

    Abstract: This paper proposes the Spatio-Temporal Crowdedness Inference Model (STCIM), a framework to infer the passenger distribution inside the whole urban rail transit (URT) system in real-time. Our model is practical since the model is designed in a probabilistic manner and only based on the entry and exit timestamps information collected by the automatic fare collection (AFC) system. Firstly, the entir… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: Accepted to IEEE CASE 2023

  19. arXiv:2306.08339  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Fermi Surface Evolution and Anomalous Hall Effect in an Ideal Type-II Weyl Semimetal

    Authors: Qianni Jiang, Johanna C. Palmstrom, John Singleton, Shalinee Chikara, David Graf, Chong Wang, Yue Shi, Paul Malinowski, Aaron Wang, Zhong Lin, Lingnan Shen, Xiaodong Xu, Di Xiao, Jiun-Haw Chu

    Abstract: Weyl semimetals (WSMs) are three-dimensional topological materials that exhibit fascinating properties due to the presence of Weyl nodes in their band structure. However, existing WSMs discovered so far often possess multiple pairs of Weyl nodes, posing a challenge in disentangling the contributions to transport phenomena from different energy bands. To overcome this challenge, we have identified… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  20. arXiv:2306.08103  [pdf, other

    cs.CV

    Generating Images with 3D Annotations Using Diffusion Models

    Authors: Wufei Ma, Qihao Liu, Jiahao Wang, Angtian Wang, Xiaoding Yuan, Yi Zhang, Zihao Xiao, Guofeng Zhang, Beijia Lu, Ruxiao Duan, Yongrui Qi, Adam Kortylewski, Yaoyao Liu, Alan Yuille

    Abstract: Diffusion models have emerged as a powerful generative method, capable of producing stunning photo-realistic images from natural language descriptions. However, these models lack explicit control over the 3D structure in the generated images. Consequently, this hinders our ability to obtain detailed 3D annotations for the generated images or to craft instances with specific poses and distances. In… ▽ More

    Submitted 3 April, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: ICLR 2024 Spotlight. Code: https://ccvl.jhu.edu/3D-DST/

  21. arXiv:2306.05984  [pdf

    q-bio.PE

    Noncoding RNAs evolutionarily extend animal lifespan

    Authors: Anyou Wang

    Abstract: The mechanisms underlying lifespan evolution in organisms have long been mysterious. However, recent studies have demonstrated that organisms evolutionarily gain noncoding RNAs (ncRNAs) that carry endogenous profound functions in higher organisms, including lifespan. This study unveils ncRNAs as crucial drivers driving animal lifespan evolution. Species in the animal kingdom evolutionarily increas… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 13 pages and 4 figures

  22. arXiv:2306.03622  [pdf, other

    cs.DC

    FaaSwap: SLO-Aware, GPU-Efficient Serverless Inference via Model Swap**

    Authors: Minchen Yu, Ao Wang, Dong Chen, Haoxuan Yu, Xiaonan Luo, Zhuohao Li, Wei Wang, Ruichuan Chen, Dapeng Nie, Haoran Yang

    Abstract: Serverless computing has become increasingly popular for machine learning inference. However, current serverless platforms lack efficient support for GPUs, limiting their ability to deliver low-latency inference. In this paper, we propose FaaSwap, a GPU-efficient serverless inference platform. FaaSwap employs a holistic approach to system and algorithm design. It maintains models in main memory an… ▽ More

    Submitted 8 February, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

  23. arXiv:2306.03511  [pdf, other

    eess.IV cs.CV

    Curriculum-Based Augmented Fourier Domain Adaptation for Robust Medical Image Segmentation

    Authors: An Wang, Mobarakol Islam, Mengya Xu, Hongliang Ren

    Abstract: Accurate and robust medical image segmentation is fundamental and crucial for enhancing the autonomy of computer-aided diagnosis and intervention systems. Medical data collection normally involves different scanners, protocols, and populations, making domain adaptation (DA) a highly demanding research field to alleviate model degradation in the deployment site. To preserve the model performance ac… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: Work under review. First three authors contributed equally

  24. Combined analysis of the $γn \to K^0Σ^0$ and $γn \to K^+Σ^-$ reactions

    Authors: Neng-Chang Wei, Ai-Chao Wang, Fei Huang

    Abstract: The recently released data on differential cross sections for $γn \to K^0Σ^0$ from the A2 and BGOOD Collaborations are used to examine the theoretical model constructed in our previous work [Phys. Rev. D \textbf{105}, 094017 (2022)] for $γn \to K^+Σ^-$, and it is found that the model predictions are able to qualitatively reproduce the A2 data but fail to describe the BGOOD data. Then, a combined a… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 14 pages,17 figures; Accepted for publication in Physical Review D

  25. arXiv:2306.00451  [pdf, other

    eess.IV cs.CV

    S$^2$ME: Spatial-Spectral Mutual Teaching and Ensemble Learning for Scribble-supervised Polyp Segmentation

    Authors: An Wang, Mengya Xu, Yang Zhang, Mobarakol Islam, Hongliang Ren

    Abstract: Fully-supervised polyp segmentation has accomplished significant triumphs over the years in advancing the early diagnosis of colorectal cancer. However, label-efficient solutions from weak supervision like scribbles are rarely explored yet primarily meaningful and demanding in medical practice due to the expensiveness and scarcity of densely-annotated polyp data. Besides, various deployment issues… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: MICCAI 2023 Early Acceptance

  26. arXiv:2306.00118  [pdf, other

    cs.CV

    Neural Textured Deformable Meshes for Robust Analysis-by-Synthesis

    Authors: Angtian Wang, Wufei Ma, Alan Yuille, Adam Kortylewski

    Abstract: Human vision demonstrates higher robustness than current AI algorithms under out-of-distribution scenarios. It has been conjectured such robustness benefits from performing analysis-by-synthesis. Our paper formulates triple vision tasks in a consistent manner using approximate analysis-by-synthesis by render-and-compare algorithms on neural features. In this work, we introduce Neural Textured Defo… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  27. arXiv:2305.20087  [pdf, other

    cs.CV

    Too Large; Data Reduction for Vision-Language Pre-Training

    Authors: Alex **peng Wang, Kevin Qinghong Lin, David Junhao Zhang, Stan Weixian Lei, Mike Zheng Shou

    Abstract: This paper examines the problems of severe image-text misalignment and high redundancy in the widely-used large-scale Vision-Language Pre-Training (VLP) datasets. To address these issues, we propose an efficient and straightforward Vision-Language learning algorithm called TL;DR, which aims to compress the existing large VLP data into a small, high-quality set. Our approach consists of two major s… ▽ More

    Submitted 18 August, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: ICCV2023. Code: https://github.com/showlab/datacentric.vlp

  28. arXiv:2305.17663  [pdf, other

    cs.CL

    Lexical Retrieval Hypothesis in Multimodal Context

    Authors: Po-Ya Angela Wang, Pin-Er Chen, Hsin-Yu Chou, Yu-Hsiang Tseng, Shu-Kai Hsieh

    Abstract: Multimodal corpora have become an essential language resource for language science and grounded natural language processing (NLP) systems due to the growing need to understand and interpret human communication across various channels. In this paper, we first present our efforts in building the first Multimodal Corpus for Languages in Taiwan (MultiMoco). Based on the corpus, we conduct a case study… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  29. arXiv:2305.16124  [pdf, other

    cs.CV

    Robust Category-Level 3D Pose Estimation from Synthetic Data

    Authors: Jiahao Yang, Wufei Ma, Angtian Wang, Xiaoding Yuan, Alan Yuille, Adam Kortylewski

    Abstract: Obtaining accurate 3D object poses is vital for numerous computer vision applications, such as 3D reconstruction and scene understanding. However, annotating real-world objects is time-consuming and challenging. While synthetically generated training data is a viable alternative, the domain shift between real and synthetic data is a significant challenge. In this work, we aim to narrow the perform… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  30. Black hole scalarizations induced by parity violations

    Authors: Hao-Jie Lin, Tao Zhu, Shao-Jun Zhang, Anzhong Wang

    Abstract: It is well-known that parity symmetry is broken in the weak interaction but conserved for Einstein's general relativity and Maxwell's electromagnetic theory. Nevertheless, parity symmetry could also be violated in the gravitational/electromagnetic sectors if a fundamental scalar field couples to the parity-violating gravitational/electromagnetic curvature terms. Such parity-violating terms, which… ▽ More

    Submitted 27 July, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 9 pages, 3 figures, 1 table

    Journal ref: Phys. Rev. D 108, 044005 (2023)

  31. arXiv:2305.14767  [pdf

    stat.ME

    Interpretation and visualization of distance covariance through additive decomposition of correlations formula

    Authors: Andi Wang, Hao Yan, Juan Du

    Abstract: Distance covariance is a widely used statistical methodology for testing the dependency between two groups of variables. Despite the appealing properties of consistency and superior testing power, the testing results of distance covariance are often hard to be interpreted. This paper presents an elementary interpretation of the mechanism of distance covariance through an additive decomposition of… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  32. arXiv:2305.14668  [pdf, other

    cs.CV

    Robust 3D-aware Object Classification via Discriminative Render-and-Compare

    Authors: Artur Jesslen, Guofeng Zhang, Angtian Wang, Alan Yuille, Adam Kortylewski

    Abstract: In real-world applications, it is essential to jointly estimate the 3D object pose and class label of objects, i.e., to perform 3D-aware classification.While current approaches for either image classification or pose estimation can be extended to 3D-aware classification, we observe that they are inherently limited: 1) Their performance is much lower compared to the respective single-task models, a… ▽ More

    Submitted 5 June, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  33. arXiv:2305.14616  [pdf, other

    cs.CL cs.CV

    Exploring Affordance and Situated Meaning in Image Captions: A Multimodal Analysis

    Authors: Pin-Er Chen, Po-Ya Angela Wang, Hsin-Yu Chou, Yu-Hsiang Tseng, Shu-Kai Hsieh

    Abstract: This paper explores the grounding issue regarding multimodal semantic representation from a computational cognitive-linguistic view. We annotate images from the Flickr30k dataset with five perceptual properties: Affordance, Perceptual Salience, Object Number, Gaze Cueing, and Ecological Niche Association (ENA), and examine their association with textual elements in the image captions. Our findings… ▽ More

    Submitted 24 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 10 pages, 9 figures

  34. arXiv:2305.13268  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Spin-phonon scattering-induced low thermal conductivity in a van der Waals layered ferromagnet Cr$_2$Si$_2$Te$_6$

    Authors: Kunya Yang, Hong Wu, Zefang Li, Chen Ran, Xiao Wang, Fengfeng Zhu, Xiangnan Gong, Yan Liu, Guiwen Wang, Long Zhang, Xinrun Mi, Aifeng Wang, Yisheng Chai, Yixi Su, Wenhong Wang, Mingquan He, Xiaolong Yang, Xiaoyuan Zhou

    Abstract: Layered van der Waals (vdW) magnets are prominent playgrounds for develo** magnetoelectric, magneto-optic and spintronic devices. In spintronics, particularly in spincaloritronic applications, low thermal conductivity ($κ$) is highly desired. Here, by combining thermal transport measurements with density functional theory calculations, we demonstrate low $κ$ down to 1 W m$^{-1}$ K$^{-1}$ in a ty… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: 14 pages, 6 figures, accepted for publication in Advanced Functional Materials

    Journal ref: Adv. Funct. Mater. 2302191 (2023)

  35. arXiv:2305.12726  [pdf, other

    cs.CV cs.CL cs.MM

    Towards Explainable In-the-Wild Video Quality Assessment: A Database and a Language-Prompted Approach

    Authors: Haoning Wu, Erli Zhang, Liang Liao, Chaofeng Chen, **gwen Hou, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin

    Abstract: The proliferation of in-the-wild videos has greatly expanded the Video Quality Assessment (VQA) problem. Unlike early definitions that usually focus on limited distortion types, VQA on in-the-wild videos is especially challenging as it could be affected by complicated factors, including various distortions and diverse contents. Though subjective studies have collected overall quality scores for th… ▽ More

    Submitted 3 August, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Proceedings of the 31st ACM International Conference on Multimedia (MM '23)

  36. arXiv:2305.09617  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Expert-Level Medical Question Answering with Large Language Models

    Authors: Karan Singhal, Tao Tu, Juraj Gottweis, Rory Sayres, Ellery Wulczyn, Le Hou, Kevin Clark, Stephen Pfohl, Heather Cole-Lewis, Darlene Neal, Mike Schaekermann, Amy Wang, Mohamed Amin, Sami Lachgar, Philip Mansfield, Sushant Prakash, Bradley Green, Ewa Dominowska, Blaise Aguera y Arcas, Nenad Tomasev, Yun Liu, Renee Wong, Christopher Semturs, S. Sara Mahdavi, Joelle Barral , et al. (6 additional authors not shown)

    Abstract: Recent artificial intelligence (AI) systems have reached milestones in "grand challenges" ranging from Go to protein-folding. The capability to retrieve medical knowledge, reason over it, and answer medical questions comparably to physicians has long been viewed as one such grand challenge. Large language models (LLMs) have catalyzed significant progress in medical question answering; Med-PaLM w… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  37. arXiv:2305.07152  [pdf, other

    cs.CV

    Surgical tool classification and localization: results and methods from the MICCAI 2022 SurgToolLoc challenge

    Authors: Aneeq Zia, Kiran Bhattacharyya, Xi Liu, Max Berniker, Ziheng Wang, Rogerio Nespolo, Satoshi Kondo, Satoshi Kasai, Kousuke Hirasawa, Bo Liu, David Austin, Yiheng Wang, Michal Futrega, Jean-Francois Puget, Zhenqiang Li, Yoichi Sato, Ryo Fujii, Ryo Hachiuma, Mana Masuda, Hideo Saito, An Wang, Mengya Xu, Mobarakol Islam, Long Bai, Winnie Pang , et al. (46 additional authors not shown)

    Abstract: The ability to automatically detect and track surgical instruments in endoscopic videos can enable transformational interventions. Assessing surgical performance and efficiency, identifying skilled tool use and choreography, and planning operational and logistical aspects of OR resources are just a few of the applications that could benefit. Unfortunately, obtaining the annotations needed to train… ▽ More

    Submitted 31 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  38. arXiv:2305.05861  [pdf

    q-bio.GN

    Template-based eukaryotic genome editing directed by SviCas3

    Authors: Wang-Yu Tong, Yong Li, Shou-Dong Ye, An-**g Wang, Yan-Yan Tang, Mei-Li Li, Zhong-Fan Yu, Ting-Ting Xia, Qing-Yang Liu, Si-Qi Zhu

    Abstract: RNA-guided gene editing based on the CRISPR-Cas system is currently the most effective genome editing technique. Here, we report that the SviCas3 from the subtype I-B-Svi Cas system in Streptomyces virginiae IBL14 is an RNA-guided and DNA-guided DNA endonuclease suitable for the HDR-directed gene and/or base editing of eukaryotic cell genomes. The genome editing efficiency of SviCas3 guided by DNA… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 113 pages, 12 figures and 4 tables

  39. arXiv:2305.03925  [pdf, other

    q-bio.MN

    Structure-Function Dynamics Hybrid Modeling: RNA Degradation

    Authors: Hua Zheng, Wei Xie, Paul Whitford, Ailun Wang, Chunsheng Fang, Wandi Xu

    Abstract: RNA structure and functional dynamics play fundamental roles in controlling biological systems. Molecular dynamics simulation, which can characterize interactions at an atomistic level, can advance the understanding on new drug discovery, manufacturing, and delivery mechanisms. However, it is computationally unattainable to support the development of a digital twin for enzymatic reaction network m… ▽ More

    Submitted 17 June, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: 12 pages, 5 figures

  40. arXiv:2305.03470  [pdf, other

    astro-ph.GA astro-ph.HE

    Mildly Relativistic Motion in the Radio Quiet Quasar PG 1351+640

    Authors: Ailing Wang, Tao An, Shaoguang Guo, Luis C. Ho, Willem A. Baan, Robert Braun, Sina Chen, Xiaopeng Cheng, Philippa Hartley, Jun Yang, Yingkang Zhang

    Abstract: Measuring the proper motion of the emission component in radio-quiet quasars (RQQs) could help to distinguish between the origins of the radio emission and to understand whether the jet production mechanism is the same in radio-loud quasars (RLQs) and RQQs. PG 1351+640 is one of the few RQQs suitable for proper motion studies: it has two compact components on milli-arcsecond scales, a flat-spectru… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: The article has been published by Oxford University Press: https://academic.oup.com/mnrasl/advance-article-abstract/doi/10.1093/mnrasl/slad051/7146829?utm_source=advanceaccess&utm_campaign=mnrasl&utm_medium=email

  41. arXiv:2305.01776  [pdf, other

    cs.CY

    Taxonomizing and Measuring Representational Harms: A Look at Image Tagging

    Authors: Jared Katzman, Angelina Wang, Morgan Scheuerman, Su Lin Blodgett, Kristen Laird, Hanna Wallach, Solon Barocas

    Abstract: In this paper, we examine computational approaches for measuring the "fairness" of image tagging systems, finding that they cluster into five distinct categories, each with its own analytic foundation. We also identify a range of normative concerns that are often collapsed under the terms "unfairness," "bias," or even "discrimination" when discussing problematic cases of image tagging. Specificall… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: AAAI-23 Special Track on AI for Social Impact

    Journal ref: Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023)

  42. arXiv:2305.01638  [pdf, other

    cs.LG cs.CV stat.ML

    Sequence Modeling with Multiresolution Convolutional Memory

    Authors: Jiaxin Shi, Ke Alexander Wang, Emily B. Fox

    Abstract: Efficiently capturing the long-range patterns in sequential data sources salient to a given task -- such as classification and generative modeling -- poses a fundamental challenge. Popular approaches in the space tradeoff between the memory burden of brute-force enumeration and comparison, as in transformers, the computational burden of complicated sequential dependencies, as in recurrent neural n… ▽ More

    Submitted 1 November, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: ICML 2023, Source code: https://github.com/thjashin/multires-conv

  43. arXiv:2305.01271  [pdf, other

    cond-mat.soft

    Lipid exchange promotes fusion of model protocells

    Authors: Ziyan Fan, Yaam Deckel, Lauren A. Lowe, Daniel W. K. Loo, Tetsuya Yomo, Jack W. Szostak, Collin Nisler, Anna Wang

    Abstract: Vesicle fusion is an important process underlying cell division, transport, and membrane trafficking. In phospholipid systems, a range of fusogens including divalent cations and depletants have been shown to induce adhesion, hemifusion, and then full content fusion between vesicles. This works shows that these fusogens do not perform the same function for fatty acid vesicles, which are used as mod… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 15 pages, 7 figures

  44. arXiv:2305.00510  [pdf, other

    cs.HC cs.CV cs.LG

    Towards AI-Architecture Liberty: A Comprehensive Survey on Designing and Collaborating Virtual Architecture by Deep Learning in the Metaverse

    Authors: Anqi Wang, Jiahua Dong, Lik-Hang Lee, Jiachuan Shen, Pan Hui

    Abstract: 3D shape generation techniques leveraging deep learning have garnered significant interest from both the computer vision and architectural design communities, promising to enrich the content of the future metaverse. However, research on virtual architectural design remains limited, particularly regarding human-AI collaboration and deep learning-assisted design. We first illuminate the principles,… ▽ More

    Submitted 7 April, 2024; v1 submitted 30 April, 2023; originally announced May 2023.

    Comments: 37 pages, 9 figures, and 5 tables

    ACM Class: I.2.1; J.5; J.6; I.3.7

  45. arXiv:2304.14674  [pdf, other

    eess.IV cs.CV cs.RO

    SAM Meets Robotic Surgery: An Empirical Study in Robustness Perspective

    Authors: An Wang, Mobarakol Islam, Mengya Xu, Yang Zhang, Hongliang Ren

    Abstract: Segment Anything Model (SAM) is a foundation model for semantic segmentation and shows excellent generalization capability with the prompts. In this empirical study, we investigate the robustness and zero-shot generalizability of the SAM in the domain of robotic surgery in various settings of (i) prompted vs. unprompted; (ii) bounding box vs. points-based prompt; (iii) generalization under corrupt… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: Work under active progress

  46. arXiv:2304.14672  [pdf, other

    cs.CV

    Towards Robust Text-Prompted Semantic Criterion for In-the-Wild Video Quality Assessment

    Authors: Haoning Wu, Liang Liao, Annan Wang, Chaofeng Chen, **gwen Hou, Wenxiu Sun, Qiong Yan, Weisi Lin

    Abstract: The proliferation of videos collected during in-the-wild natural settings has pushed the development of effective Video Quality Assessment (VQA) methodologies. Contemporary supervised opinion-driven VQA strategies predominantly hinge on training from expensive human annotations for quality scores, which limited the scale and distribution of VQA datasets and consequently led to unsatisfactory gener… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: 13 pages, 10 figures, under review

  47. arXiv:2304.14300  [pdf, other

    cs.LG math.DS q-bio.QM

    Learning Absorption Rates in Glucose-Insulin Dynamics from Meal Covariates

    Authors: Ke Alexander Wang, Matthew E. Levine, Jiaxin Shi, Emily B. Fox

    Abstract: Traditional models of glucose-insulin dynamics rely on heuristic parameterizations chosen to fit observations within a laboratory setting. However, these models cannot describe glucose dynamics in daily life. One source of failure is in their descriptions of glucose absorption rates after meal events. A meal's macronutritional content has nuanced effects on the absorption profile, which is difficu… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: Work presented at NeurIPS 2022 Workshop on Learning from Time Series for Health (TS4H). arXiv admin note: substantial text overlap with arXiv:2302.11939

  48. Periodic orbits and their gravitational wave radiations in a polymer black hole in loop quantum gravity

    Authors: Ze-Yi Tu, Tao Zhu, Anzhong Wang

    Abstract: This article provides a detailed investigation into the motion of the surrounding particles around a polymer black hole in loop quantum gravity (LQG). Using effective potential, the critical bound orbits and innermost stable circular orbits (ISCO) are analyzed. The study finds that the radii and angular momentum of the critical bound orbits decrease with an increase in the parameter $A_λ$ which la… ▽ More

    Submitted 20 July, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: 14 pages, 10 figures, 2 tables;v2:version appeared in PRD

    Journal ref: Phys Rev D 108 (2023) 2, 024035

  49. arXiv:2304.13138  [pdf, other

    cs.AI cs.LG

    The Update-Equivalence Framework for Decision-Time Planning

    Authors: Samuel Sokota, Gabriele Farina, David J. Wu, Hengyuan Hu, Kevin A. Wang, J. Zico Kolter, Noam Brown

    Abstract: The process of revising (or constructing) a policy at execution time -- known as decision-time planning -- has been key to achieving superhuman performance in perfect-information games like chess and Go. A recent line of work has extended decision-time planning to imperfect-information games, leading to superhuman performance in poker. However, these methods involve solving subgames whose sizes gr… ▽ More

    Submitted 13 May, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

  50. arXiv:2304.12164  [pdf, other

    cs.RO cs.AI

    USA-Net: Unified Semantic and Affordance Representations for Robot Memory

    Authors: Benjamin Bolte, Austin Wang, Jimmy Yang, Mustafa Mukadam, Mrinal Kalakrishnan, Chris Paxton

    Abstract: In order for robots to follow open-ended instructions like "go open the brown cabinet over the sink", they require an understanding of both the scene geometry and the semantics of their environment. Robotic systems often handle these through separate pipelines, sometimes using very different representation spaces, which can be suboptimal when the two objectives conflict. In this work, we present U… ▽ More

    Submitted 24 April, 2023; v1 submitted 24 April, 2023; originally announced April 2023.