Skip to main content

Showing 51–100 of 610 results for author: Ma, K

.
  1. arXiv:2403.06406  [pdf, other

    cs.CV

    Comparison of No-Reference Image Quality Models via MAP Estimation in Diffusion Latents

    Authors: Weixia Zhang, Dingquan Li, Guangtao Zhai, Xiaokang Yang, Kede Ma

    Abstract: Contemporary no-reference image quality assessment (NR-IQA) models can effectively quantify the perceived image quality, with high correlations between model predictions and human perceptual scores on fixed test sets. However, little progress has been made in comparing NR-IQA models from a perceptual optimization perspective. Here, for the first time, we demonstrate that NR-IQA models can be plugg… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  2. arXiv:2403.04437  [pdf, other

    cs.CV

    StableDrag: Stable Dragging for Point-based Image Editing

    Authors: Yutao Cui, Xiaotong Zhao, Guozhen Zhang, Shengming Cao, Kai Ma, Limin Wang

    Abstract: Point-based image editing has attracted remarkable attention since the emergence of DragGAN. Recently, DragDiffusion further pushes forward the generative quality via adapting this dragging technique to diffusion models. Despite these great success, this dragging scheme exhibits two major drawbacks, namely inaccurate point tracking and incomplete motion supervision, which may result in unsatisfact… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  3. arXiv:2403.02752  [pdf, other

    cs.HC

    HINTs: Sensemaking on large collections of documents with Hypergraph visualization and INTelligent agents

    Authors: Sam Yu-Te Lee, Kwan-Liu Ma

    Abstract: Sensemaking on a large collection of documents (corpus) is a challenging task often found in fields such as market research, legal studies, intelligence analysis, political science, computational linguistics, etc. Previous works approach this problem either from a topic- or entity-based perspective, but they lack interpretability and trust due to poor model alignment. In this paper, we present HIN… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  4. arXiv:2403.00862  [pdf, other

    cs.CL cs.AI

    NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism

    Authors: Miao Li, Ming-Bin Chen, Bo Tang, Shengbin Hou, Pengyu Wang, Haiying Deng, Zhiyu Li, Feiyu Xiong, Keming Mao, Peng Cheng, Yi Luo

    Abstract: We present NewsBench, a novel evaluation framework to systematically assess the capabilities of Large Language Models (LLMs) for editorial capabilities in Chinese journalism. Our constructed benchmark dataset is focused on four facets of writing proficiency and six facets of safety adherence, and it comprises manually and carefully designed 1,267 test samples in the types of multiple choice questi… ▽ More

    Submitted 4 June, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: Long paper, ACL 2024 Main

  5. arXiv:2403.00334  [pdf, other

    cs.HC

    NOVA: A visual interface for assessing polarizing media coverage

    Authors: Keshav Dasu, Sam Yu-Te Lee, Ying-Cheng Chen, Kwan-Liu Ma

    Abstract: Within the United States, the majority of the populace receives their news online. U.S mainstream media outlets both generate and influence the news consumed by U.S citizens. Many of these citizens have their personal beliefs about these outlets and question the fairness of their reporting. We offer an interactive visualization system for the public to assess their perception of the mainstream med… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  6. arXiv:2402.19276  [pdf, other

    eess.IV cs.CV

    Modular Blind Video Quality Assessment

    Authors: Wen Wen, Mu Li, Yabin Zhang, Yiting Liao, Junlin Li, Li Zhang, Kede Ma

    Abstract: Blind video quality assessment (BVQA) plays a pivotal role in evaluating and improving the viewing experience of end-users across a wide range of video-based platforms and services. Contemporary deep learning-based models primarily analyze video content in its aggressively subsampled format, while being blind to the impact of the actual spatial resolution and frame rate on video quality. In this p… ▽ More

    Submitted 31 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted by CVPR 2024; Camera-ready version

  7. arXiv:2402.17766  [pdf, other

    cs.CV

    ShapeLLM: Universal 3D Object Understanding for Embodied Interaction

    Authors: Zekun Qi, Runpei Dong, Shaochen Zhang, Haoran Geng, Chunrui Han, Zheng Ge, He Wang, Li Yi, Kaisheng Ma

    Abstract: This paper presents ShapeLLM, the first 3D Multimodal Large Language Model (LLM) designed for embodied interaction, exploring a universal 3D object understanding with 3D point clouds and languages. ShapeLLM is built upon an improved 3D encoder by extending ReCon to ReCon++ that benefits from multi-view image distillation for enhanced geometry understanding. By utilizing ReCon++ as the 3D point clo… ▽ More

    Submitted 6 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Project page: https://qizekun.github.io/shapellm/

  8. arXiv:2402.15678  [pdf, other

    cs.DC

    Minions: Accelerating Large Language Model Inference with Adaptive and Collective Speculative Decoding

    Authors: Siqi Wang, Hailong Yang, Xuezhu Wang, Tongxuan Liu, Pengbo Wang, Xuning Liang, Kejie Ma, Tianyu Feng, Xin You, Yongjun Bao, Yi Liu, Zhongzhi Luan, Depei Qian

    Abstract: Large language models (LLM) have recently attracted surging interest due to their outstanding capabilities across various domains. However, enabling efficient LLM inference is challenging due to its autoregressive decoding that generates tokens only one at a time. Although research works apply pruning or quantization to speed up LLM inference, they typically require fine-tuning the LLM, incurring… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  9. Automating Psychological Hypothesis Generation with AI: Large Language Models Meet Causal Graph

    Authors: Song Tong, Kai Mao, Zhen Huang, Yukun Zhao, Kai** Peng

    Abstract: Leveraging the synergy between causal knowledge graphs and a large language model (LLM), our study introduces a groundbreaking approach for computational hypothesis generation in psychology. We analyzed 43,312 psychology articles using a LLM to extract causal relation pairs. This analysis produced a specialized causal graph for psychology. Applying link prediction algorithms, we generated 130 pote… ▽ More

    Submitted 17 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  10. arXiv:2402.14354  [pdf, other

    cs.CV

    GAM-Depth: Self-Supervised Indoor Depth Estimation Leveraging a Gradient-Aware Mask and Semantic Constraints

    Authors: Anqi Cheng, Zhiyuan Yang, Haiyue Zhu, Kezhi Mao

    Abstract: Self-supervised depth estimation has evolved into an image reconstruction task that minimizes a photometric loss. While recent methods have made strides in indoor depth estimation, they often produce inconsistent depth estimation in textureless areas and unsatisfactory depth discrepancies at object boundaries. To address these issues, in this work, we propose GAM-Depth, developed upon two novel co… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: To be published in 2024 IEEE International Conference on Robotics and Automation (ICRA)

  11. arXiv:2402.12774  [pdf, other

    cs.IR

    Interpreting Conversational Dense Retrieval by Rewriting-Enhanced Inversion of Session Embedding

    Authors: Yiruo Cheng, Kelong Mao, Zhicheng Dou

    Abstract: Conversational dense retrieval has shown to be effective in conversational search. However, a major limitation of conversational dense retrieval is their lack of interpretability, hindering intuitive understanding of model behaviors for targeted improvements. This paper presents CONVINV, a simple yet effective approach to shed light on interpretable conversational dense retrieval models. CONVINV t… ▽ More

    Submitted 1 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted by ACL 2024. Repo: https://github.com/Ariya12138/ConvInv

  12. arXiv:2402.11480  [pdf, other

    cs.IR

    Pattern-wise Transparent Sequential Recommendation

    Authors: Kun Ma, Cong Xu, Zeyuan Chen, Wei Zhang

    Abstract: A transparent decision-making process is essential for develo** reliable and trustworthy recommender systems. For sequential recommendation, it means that the model can identify critical items asthe justifications for its recommendation results. However, achieving both model transparency and recommendation performance simultaneously is challenging, especially for models that take the entire sequ… ▽ More

    Submitted 9 March, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  13. arXiv:2402.11419  [pdf, other

    eess.SP

    A Self-Healing Magnetic-Array-Type Current Sensor with Data-Driven Identification of Abnormal Magnetic Measurement Units

    Authors: Xiaohu Liu, Wei Zhao, Kang Ma, Jian Liu, Lisha Peng, Songling Huang, Shisong Li

    Abstract: Magnetic-array-type current sensors have garnered increasing popularity owing to their notable advantages, including broadband functionality, a large dynamic range, cost-effectiveness, and compact dimensions. However, the susceptibility of the measurement error of one or more magnetic measurement units (MMUs) within the current sensor to drift significantly from the nominal value due to environmen… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: 11 pages, 10 figures

  14. arXiv:2402.11250  [pdf, other

    eess.IV cs.CV cs.MM

    Hierarchical Prior-based Super Resolution for Point Cloud Geometry Compression

    Authors: Dingquan Li, Kede Ma, **g Wang, Ge Li

    Abstract: The Geometry-based Point Cloud Compression (G-PCC) has been developed by the Moving Picture Experts Group to compress point clouds. In its lossy mode, the reconstructed point cloud by G-PCC often suffers from noticeable distortions due to the naïve geometry quantization (i.e., grid downsampling). This paper proposes a hierarchical prior-based super resolution method for point cloud geometry compre… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  15. arXiv:2402.09760  [pdf, other

    cs.CL cs.AI cs.IR

    Grounding Language Model with Chunking-Free In-Context Retrieval

    Authors: Hong** Qian, Zheng Liu, Kelong Mao, Yujia Zhou, Zhicheng Dou

    Abstract: This paper presents a novel Chunking-Free In-Context (CFIC) retrieval approach, specifically tailored for Retrieval-Augmented Generation (RAG) systems. Traditional RAG systems often struggle with grounding responses using precise evidence text due to the challenges of processing lengthy documents and filtering out irrelevant content. Commonly employed solutions, such as document chunking and adapt… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  16. arXiv:2402.07431  [pdf, other

    cs.CL cs.CY

    SALAD: Smart AI Language Assistant Daily

    Authors: Ragib Amin Nihal, Tran Dong Huu Quoc, Lin Zirui, Xu Yimimg, Liu Haoran, An Zhaoyi, Kyou Ma

    Abstract: SALAD is an AI-driven language-learning application designed to help foreigners learn Japanese. It offers translations in Kanji-Kana-Romaji, speech recognition, translated audio, vocabulary tracking, grammar explanations, and songs generated from newly learned words. The app targets beginners and intermediate learners, aiming to make language acquisition more accessible and enjoyable. SALAD uses d… ▽ More

    Submitted 13 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  17. arXiv:2402.07236  [pdf

    stat.AP

    Extending Inferences from Randomized Clinical Trials to Target Populations: A Sco** Review of Transportability Methods

    Authors: Guanbo Wang, Ting-Wei Ernie Liao, David Furfaro, Leo Anthony Celi, Kevin Sheng-Kai Ma

    Abstract: Objective: Randomized controlled trial (RCT) results often inform clinical decision-making, but the highly curated populations of trials and the care provided during the trial are often not reflective of real-world practice. The objective of this sco** review is to identify the ability of methods to transport findings from RCTs to target populations. Study design: A sco** review was conducted… ▽ More

    Submitted 23 February, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

  18. arXiv:2402.07092  [pdf, other

    cs.CL cs.IR

    Generalizing Conversational Dense Retrieval via LLM-Cognition Data Augmentation

    Authors: Haonan Chen, Zhicheng Dou, Kelong Mao, Jiongnan Liu, Ziliang Zhao

    Abstract: Conversational search utilizes muli-turn natural language contexts to retrieve relevant passages. Existing conversational dense retrieval models mostly view a conversation as a fixed sequence of questions and responses, overlooking the severe data sparsity problem -- that is, users can perform a conversation in various ways, and these alternate conversations are unrecorded. Consequently, they ofte… ▽ More

    Submitted 3 June, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: ACL 2024

  19. arXiv:2402.05817  [pdf

    eess.IV cs.CV cs.LG

    Using YOLO v7 to Detect Kidney in Magnetic Resonance Imaging

    Authors: Pouria Yazdian Anari, Fiona Obiezu, Nathan Lay, Fatemeh Dehghani Firouzabadi, Aditi Chaurasia, Mahshid Golagha, Shiva Singh, Fatemeh Homayounieh, Aryan Zahergivar, Stephanie Harmon, Evrim Turkbey, Rabindra Gautam, Kevin Ma, Maria Merino, Elizabeth C. Jones, Mark W. Ball, W. Marston Linehan, Baris Turkbey, Ashkan A. Malayeri

    Abstract: Introduction This study explores the use of the latest You Only Look Once (YOLO V7) object detection method to enhance kidney detection in medical imaging by training and testing a modified YOLO V7 on medical image formats. Methods Study includes 878 patients with various subtypes of renal cell carcinoma (RCC) and 206 patients with normal kidneys. A total of 5657 MRI scans for 1084 patients were r… ▽ More

    Submitted 12 February, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  20. arXiv:2402.05009  [pdf, other

    stat.AP

    A Review on Trajectory Datasets on Advanced Driver Assistance System

    Authors: Hang Zhou, Ke Ma, Xiaopeng Li

    Abstract: This paper presents a comprehensive review of trajectory data of Advanced Driver Assistance System equipped-vehicle, with the aim of precisely model of Autonomous Vehicles (AVs) behavior. This study emphasizes the importance of trajectory data in the development of AV models, especially in car-following scenarios. We introduce and evaluate several datasets: the OpenACC Dataset, the Connected & Aut… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 6 pages, 2 figures

  21. arXiv:2402.00629  [pdf, other

    cs.AR

    Cocco: Hardware-Map** Co-Exploration towards Memory Capacity-Communication Optimization

    Authors: Zhanhong Tan, Zijian Zhu, Kaisheng Ma

    Abstract: Memory is a critical design consideration in current data-intensive DNN accelerators, as it profoundly determines energy consumption, bandwidth requirements, and area costs. As DNN structures become more complex, a larger on-chip memory capacity is required to reduce data movement overhead, but at the expense of silicon costs. Some previous works have proposed memory-oriented optimizations, such a… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted by 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'24)

  22. arXiv:2401.16659  [pdf, other

    cs.IR cs.CL

    History-Aware Conversational Dense Retrieval

    Authors: Fengran Mo, Chen Qu, Kelong Mao, Tianyu Zhu, Zhan Su, Kaiyu Huang, Jian-Yun Nie

    Abstract: Conversational search facilitates complex information retrieval by enabling multi-turn interactions between users and the system. Supporting such interactions requires a comprehensive understanding of the conversational inputs to formulate a good search query based on historical information. In particular, the search query should include the relevant information from the previous conversation turn… ▽ More

    Submitted 28 May, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted to Findings of ACL 2024

  23. arXiv:2401.13919  [pdf, other

    cs.CL cs.AI

    WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

    Authors: Hongliang He, Wenlin Yao, Kaixin Ma, Wenhao Yu, Yong Dai, Hongming Zhang, Zhenzhong Lan, Dong Yu

    Abstract: The rapid advancement of large language models (LLMs) has led to a new era marked by the development of autonomous applications in real-world scenarios, which drives innovation in creating advanced web agents. Existing web agents typically only handle one input modality and are evaluated only in simplified web simulators or static web snapshots, greatly limiting their applicability in real-world s… ▽ More

    Submitted 6 June, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Accepted to ACL 2024 (main). Code and data is released at https://github.com/MinorJerry/WebVoyager

  24. arXiv:2401.13485  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci

    Ti4Ir2O a time-reversal-invariant fully gapped unconventional superconductor

    Authors: Debarchan Das, KeYuan Ma, Jan Jaroszynski, Vahid Sazgari, Tomasz Klimczuk, Fabian O. von Rohr, Zurab Guguchia

    Abstract: Here we report muon spin rotation (muSR) experiments on the temperature and field dependence of the effective magnetic penetration depth (lambda) in the eta-carbide-type suboxide Ti4Ir2O, a superconductor with an considerably high upper critical field. Temperature dependence of penetration depth, obtained from transverse-field (TF)-muSR measurements, is in perfect agreement with an isotropic fully… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 7 pages, 3 figures. The methodology employed in this paper bears resemblance to that described in arXiv:2209.03187

  25. arXiv:2401.13478  [pdf, other

    cs.IR cs.CL cs.CV cs.MM

    SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval

    Authors: Siwei Wu, Yizhi Li, Kang Zhu, Ge Zhang, Yiming Liang, Kai**g Ma, Chenghao Xiao, Haoran Zhang, Bohao Yang, Wenhu Chen, Wenhao Huang, Noura Al Moubayed, Jie Fu, Chenghua Lin

    Abstract: Multi-modal information retrieval (MMIR) is a rapidly evolving field, where significant progress, particularly in image-text pairing, has been made through advanced representation learning and cross-modality alignment research. However, current benchmarks for evaluating MMIR performance in image-text pairing within the scientific domain show a notable gap, where chart and table images described in… ▽ More

    Submitted 11 June, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: camera-ready version for ACL 2024 Findings

  26. GI-PIP: Do We Require Impractical Auxiliary Dataset for Gradient Inversion Attacks?

    Authors: Yu Sun, Gaojian Xiong, Xianxun Yao, Kailang Ma, Jian Cui

    Abstract: Deep gradient inversion attacks expose a serious threat to Federated Learning (FL) by accurately recovering private data from shared gradients. However, the state-of-the-art heavily relies on impractical assumptions to access excessive auxiliary data, which violates the basic data partitioning principle of FL. In this paper, a novel method, Gradient Inversion Attack using Practical Image Prior (GI… ▽ More

    Submitted 1 April, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  27. arXiv:2401.06462  [pdf, other

    cs.CV cs.HC

    AttributionScanner: A Visual Analytics System for Model Validation with Metadata-Free Slice Finding

    Authors: Xiwei Xuan, Jorge Piazentin Ono, Liang Gou, Kwan-Liu Ma, Liu Ren

    Abstract: Data slice finding is an emerging technique for validating machine learning (ML) models by identifying and analyzing subgroups in a dataset that exhibit poor performance, often characterized by distinct feature sets or descriptive metadata. However, in the context of validating vision models involving unstructured image data, this approach faces significant challenges, including the laborious and… ▽ More

    Submitted 4 May, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: 12 pages, 12 figures, 3 tables. This manuscript is under review by the IEEE Transactions on Visualization and Computer Graphics (TVCG)

  28. arXiv:2401.05960  [pdf, other

    cs.AI

    Machine Learning Insides OptVerse AI Solver: Design Principles and Applications

    Authors: Xijun Li, Fangzhou Zhu, Hui-Ling Zhen, Weilin Luo, Meng Lu, Yimin Huang, Zhenan Fan, Zirui Zhou, Yufei Kuang, Zhihai Wang, Zijie Geng, Yang Li, Haoyang Liu, Zhiwu An, Muming Yang, Jianshu Li, Jie Wang, Junchi Yan, Defeng Sun, Tao Zhong, Yong Zhang, Jia Zeng, Mingxuan Yuan, Jianye Hao, Jun Yao , et al. (1 additional authors not shown)

    Abstract: In an era of digital ubiquity, efficient resource management and decision-making are paramount across numerous industries. To this end, we present a comprehensive study on the integration of machine learning (ML) techniques into Huawei Cloud's OptVerse AI Solver, which aims to mitigate the scarcity of real-world mathematical programming instances, and to surpass the capabilities of traditional opt… ▽ More

    Submitted 17 January, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  29. arXiv:2401.05011  [pdf, other

    cs.CV

    Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object Detection

    Authors: Yucheng Han, Na Zhao, Weiling Chen, Keng Teck Ma, Hanwang Zhang

    Abstract: Semi-supervised 3D object detection is a promising yet under-explored direction to reduce data annotation costs, especially for cluttered indoor scenes. A few prior works, such as SESS and 3DIoUMatch, attempt to solve this task by utilizing a teacher model to generate pseudo-labels for unlabeled samples. However, the availability of unlabeled samples in the 3D domain is relatively limited compared… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: Code is available at https://github.com/tingxueronghua/DPKE

  30. arXiv:2401.04662  [pdf, other

    cs.CR

    The Devil Behind the Mirror: Tracking the Campaigns of Cryptocurrency Abuses on the Dark Web

    Authors: Pengcheng Xia, Zhou Yu, Kailong Wang, Kai Ma, Shuo Chen, Xiapu Luo, Ya** Zhou, Lei Wu, Guangdong Bai

    Abstract: The dark web has emerged as the state-of-the-art solution for enhanced anonymity. Just like a double-edged sword, it also inadvertently becomes the safety net and breeding ground for illicit activities. Among them, cryptocurrencies have been prevalently abused to receive illicit income while evading regulations. Despite the continuing efforts to combat illicit activities, there is still a lack of… ▽ More

    Submitted 7 April, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

  31. arXiv:2401.03700  [pdf, other

    cs.SI cs.HC cs.LG

    A Visual Analytics Design for Connecting Healthcare Team Communication to Patient Outcomes

    Authors: Hsiao-Ying Lu, Yiran Li, Kwan-Liu Ma

    Abstract: Communication among healthcare professionals (HCPs) is crucial for the quality of patient treatment. Surrounding each patient's treatment, communication among HCPs can be examined as temporal networks, constructed from Electronic Health Record (EHR) access logs. This paper introduces a visual analytics system designed to study the effectiveness and efficiency of temporal communication networks med… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  32. arXiv:2401.03206  [pdf, ps, other

    cs.LG math.NA math.OC math.PR math.ST stat.ME stat.ML

    A Robbins--Monro Sequence That Can Exploit Prior Information For Faster Convergence

    Authors: Siwei Liu, Ke Ma, Stephan M. Goetz

    Abstract: We propose a new method to improve the convergence speed of the Robbins-Monro algorithm by introducing prior information about the target point into the Robbins-Monro iteration. We achieve the incorporation of prior information without the need of a -- potentially wrong -- regression model, which would also entail additional constraints. We show that this prior-information Robbins-Monro sequence i… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: 26 pages, 5 figures

    MSC Class: 62L20; 62L05; 62L10; 60G99; 60-08; 65B99; 65C99; 90C15

  33. arXiv:2401.01518  [pdf, other

    quant-ph

    Highly Scalable Quantum Router with Frequency-Independent Scattering Spectra

    Authors: Yue Cai, Kang-Jie Ma, Jie Liu, Gang-Feng Guo, Lei Tan, Wu-Ming Liu

    Abstract: Optical quantum routers which play a crucial role in quantum networks, have been extensively studied in both theory and experiment, resulting in significant advancements in their performance. However, these routers impose stringent requirements for achieving optimal routing performance, where the incident photon frequency must be in strict resonance with one or several specific frequencies. To add… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  34. arXiv:2401.00896  [pdf, other

    cs.CV

    TrailBlazer: Trajectory Control for Diffusion-Based Video Generation

    Authors: Wan-Duo Kurt Ma, J. P. Lewis, W. Bastiaan Kleijn

    Abstract: Within recent approaches to text-to-video (T2V) generation, achieving controllability in the synthesized video is often a challenge. Typically, this issue is addressed by providing low-level per-frame guidance in the form of edge maps, depth maps, or an existing video to be altered. However, the process of obtaining such guidance can be labor-intensive. This paper focuses on enhancing controllabil… ▽ More

    Submitted 8 April, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    Comments: 14 pages, 18 figures, Project Page: https://hohonu-vicml.github.io/Trailblazer.Page/

  35. arXiv:2312.16679  [pdf

    cond-mat.mtrl-sci

    Square Moiré Superlattices in Twisted Two-Dimensional Halide Perovskites

    Authors: Shuchen Zhang, Linrui **, Yuan Lu, Linghai Zhang, Jiaqi Yang, Qiuchen Zhao, Dewei Sun, Joshua J. P. Thompson, Biao Yuan, Ke Ma, Akriti, Jee Yung Park, Yoon Ho Lee, Zitang Wei, Blake P. Finkenauer, Daria D. Blach, Sarath Kumar, Hailin Peng, Arun Mannodi-Kanakkithodi, Yi Yu, Ermin Malic, Gang Lu, Letian Dou, Libai Huang

    Abstract: Moiré superlattices have emerged as a new platform for studying strongly correlated quantum phenomena, but these systems have been largely limited to van der Waals layer two-dimensional (2D) materials. Here we introduce moiré superlattices leveraging ultra-thin, ligand-free halide perovskites, facilitated by ionic interactions. Square moiré superlattices with varying periodic lengths are clearly v… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  36. arXiv:2312.16436  [pdf, other

    cs.AR

    Gemini: Map** and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators

    Authors: **gwei Cai, Zuotong Wu, Sen Peng, Yuchen Wei, Zhanhong Tan, Guiming Shi, Mingyu Gao, Kaisheng Ma

    Abstract: Chiplet technology enables the integration of an increasing number of transistors on a single accelerator with higher yield in the post-Moore era, addressing the immense computational demands arising from rapid AI advancements. However, it also introduces more expensive packaging costs and costly Die-to-Die (D2D) interfaces, which require more area, consume higher power, and offer lower bandwidth… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: Accepted by 2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA)

  37. arXiv:2312.15903  [pdf, other

    cs.IR

    An Incremental Update Framework for Online Recommenders with Data-Driven Prior

    Authors: Chen Yang, ** Chen, Qian Yu, Xiangdong Wu, Kui Ma, Zihao Zhao, Zhiwei Fang, Wenlong Chen, Chaosheng Fan, Jie He, Chang** Peng, Zhangang Lin, **g** Shao

    Abstract: Online recommenders have attained growing interest and created great revenue for businesses. Given numerous users and items, incremental update becomes a mainstream paradigm for learning large-scale models in industrial scenarios, where only newly arrived data within a sliding window is fed into the model, meeting the strict requirements of quick response. However, this strategy would be prone to… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  38. Joint Trading and Scheduling among Coupled Carbon-Electricity-Heat-Gas Industrial Clusters

    Authors: Dafeng Zhu, Bo Yang, Yu Wu, Haoran Deng, Zhaoyang Dong, Kai Ma, ** Guan

    Abstract: This paper presents a carbon-energy coupling management framework for an industrial park, where the carbon flow model accompanying multi-energy flows is adopted to track and suppress carbon emissions on the user side. To deal with the quadratic constraint of gas flows, a bound tightening algorithm for constraints relaxation is adopted. The synergies among the carbon capture, energy storage, power-… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Journal ref: IEEE Transactions on Smart Grid, 2023

  39. arXiv:2312.08000  [pdf, other

    cs.CR

    SoK: On the Security of Non-Fungible Tokens

    Authors: Kai Ma, **tao Huang, Ningyu He, Zhuo Wang, Haoyu Wang

    Abstract: Non-fungible tokens (NFTs) drive the prosperity of the Web3 ecosystem. By November 2023, the total market value of NFT projects reached approximately 16 billion USD. Accompanying the success of NFTs are various security issues, i.e., attacks and scams are prevalent in the ecosystem. While NFTs have attracted significant attentions from both industry and academia, there is a lack of understanding o… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  40. arXiv:2312.07750  [pdf, other

    astro-ph.GA

    A Galactic Eclipse: The Small Magellanic Cloud is Forming Stars in Two, Superimposed Systems

    Authors: Claire E. Murray, Sten Hasselquist, Joshua E. G. Peek, Christina Willecke Lindberg, Andres Almeida, Yumi Choi, Jessica E. M. Craig, Helga Denes, John M. Dickey, Enrico M. Di Teodoro, Christoph Federrath, Isabella A. Gerrard, Steven J. Gibson, Denis Leahy, Min-Young Lee, Callum Lynn, Yik Ki Ma, Antoine Marchal, N. M. McClure-Griffiths, David Nidever, Hiep Nguyen, Nickolas M. **el, Elizabeth Tarantino, Lucero Uscanga, Jacco Th. van Loon

    Abstract: The structure and dynamics of the star-forming disk of the Small Magellanic Cloud (SMC) have long confounded us. The SMC is widely used as a prototype for galactic physics at low metallicity, and yet we fundamentally lack an understanding of the structure of its interstellar medium (ISM). In this work, we present a new model for the SMC by comparing the kinematics of young, massive stars with the… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: ApJ accepted. 20 pages, 18 figures

  41. arXiv:2312.06648  [pdf, other

    cs.CL cs.AI cs.IR

    Dense X Retrieval: What Retrieval Granularity Should We Use?

    Authors: Tong Chen, Hongwei Wang, Sihao Chen, Wenhao Yu, Kaixin Ma, Xinran Zhao, Hongming Zhang, Dong Yu

    Abstract: Dense retrieval has become a prominent method to obtain relevant context or world knowledge in open-domain NLP tasks. When we use a learned dense retriever on a retrieval corpus at inference time, an often-overlooked design choice is the retrieval unit in which the corpus is indexed, e.g. document, passage, or sentence. We discover that the retrieval unit choice significantly impacts the performan… ▽ More

    Submitted 11 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

  42. arXiv:2312.01679  [pdf, other

    eess.IV cs.CV cs.LG

    Adversarial Medical Image with Hierarchical Feature Hiding

    Authors: Qingsong Yao, Zecheng He, Yuexiang Li, Yi Lin, Kai Ma, Yefeng Zheng, S. Kevin Zhou

    Abstract: Deep learning based methods for medical images can be easily compromised by adversarial examples (AEs), posing a great security flaw in clinical decision-making. It has been discovered that conventional adversarial attacks like PGD which optimize the classification logits, are easy to distinguish in the feature space, resulting in accurate reactive defenses. To better understand this phenomenon an… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Our code is available at \url{https://github.com/qsyao/Hierarchical_Feature_Constraint}. arXiv admin note: text overlap with arXiv:2012.09501

  43. arXiv:2311.14288  [pdf, other

    cs.SI

    Fair Influence Maximization in Social Networks: A Community-Based Evolutionary Algorithm

    Authors: Kaicong Ma, Xinxiang Xu, Haipeng Yang, Renzhi Cao, Lei Zhang

    Abstract: Influence Maximization (IM) has been extensively studied in network science, which attempts to find a subset of users to maximize the influence spread. A new variant of IM, Fair Influence Maximization (FIM), which primarily enhances the fair propagation of information, attracts increasing attention in academic. However, existing algorithms for FIM suffer from a trade-off between fairness and runni… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  44. arXiv:2311.10745  [pdf

    cs.CY cs.OH

    "Just a little bit on the outside for the whole time": Social belonging confidence and the persistence of Machine Learning and Artificial Intelligence students

    Authors: Katherine Mao, Sharon Ferguson, James Magarian, Alison Olechowski

    Abstract: The growing field of machine learning (ML) and artificial intelligence (AI) presents a unique and unexplored case within persistence research, meaning it is unclear how past findings from engineering will apply to this develo** field. We conduct an exploratory study to gain an initial understanding of persistence in this field and identify fruitful directions for future work. One factor that has… ▽ More

    Submitted 30 October, 2023; originally announced November 2023.

    Comments: Published in the 2023 Annual Conference of the American Society for Engineering Education

    Journal ref: 2023 ASEE Annual Conference & Exposition, Baltimore , Maryland

  45. arXiv:2311.10744  [pdf

    cs.CY

    Advancing a Model of Students' Intentional Persistence in Machine Learning and Artificial Intelligence

    Authors: Sharon Ferguson, Katherine Mao, James Magarian, Alison Olechowski

    Abstract: Machine Learning (ML) and Artificial Intelligence (AI) are powering the applications we use, the decisions we make, and the decisions made about us. We have seen numerous examples of non-equitable outcomes, from facial recognition algorithms to recidivism algorithms, when they are designed without diversity in mind. Thus, we must take action to promote diversity among those in this field. A critic… ▽ More

    Submitted 30 October, 2023; originally announced November 2023.

    Comments: Presented at the 2022 Annual Conference of the American Society for Engineering Education

    Journal ref: Paper presented at 2022 ASEE Annual Conference & Exposition, Minneapolis, MN

  46. arXiv:2311.09210  [pdf, other

    cs.CL cs.AI

    Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models

    Authors: Wenhao Yu, Hongming Zhang, Xiaoman Pan, Kaixin Ma, Hongwei Wang, Dong Yu

    Abstract: Retrieval-augmented language models (RALMs) represent a substantial advancement in the capabilities of large language models, notably in reducing factual hallucination by leveraging external knowledge sources. However, the reliability of the retrieved information is not always guaranteed. The retrieval of irrelevant data can lead to misguided responses, and potentially causing the model to overloo… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Preprint

  47. arXiv:2311.06555  [pdf, other

    cs.CL cs.AI

    Heuristic-Driven Link-of-Analogy Prompting: Enhancing Large Language Models for Document-Level Event Argument Extraction

    Authors: Hanzhang Zhou, Junlang Qian, Zijian Feng, Hui Lu, Zixiao Zhu, Kezhi Mao

    Abstract: In this study, we investigate in-context learning (ICL) in document-level event argument extraction (EAE) to alleviate the dependency on large-scale labeled data for this task. We introduce the Heuristic-Driven Link-of-Analogy (HD-LoA) prompting to address the challenge of example selection and to develop a prompting strategy tailored for EAE. Specifically, we hypothesize and validate that LLMs le… ▽ More

    Submitted 19 February, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

  48. arXiv:2311.02513  [pdf

    physics.optics quant-ph

    Highly tunable room-temperature plexcitons in monolayer WSe2 /gap-plasmon nanocavities

    Authors: Thomas P. Darlington, Mahfujur Rahaman, Kevin W. C. Kwock, Emanuil Yanev, Xuehao Wu, Luke N. Holtzman, Madisen Holbrook, Gwangwoo Kim, Kyung Yeol Ma, Hyeon Suk Shin, Andrey Krayev, Matthew Strasbourg, Nicholas J. Borys, D. N. Basov, Katayun Barmak, James C. Hone, Abhay N. Pasupathy, Deep Jariwala, P. James Schuck

    Abstract: The advancement of quantum photonic technologies relies on the ability to precisely control the degrees of freedom of optically active states. Here, we realize real-time, room-temperature tunable strong plasmon-exciton coupling in 2D semiconductor monolayers enabled by a general approach that combines strain engineering plus force- and voltage-adjustable plasmonic nanocavities. We show that the ex… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 17 pages, 4 figures

  49. arXiv:2311.01016  [pdf, other

    cs.CV

    Visual Analytics for Efficient Image Exploration and User-Guided Image Captioning

    Authors: Yiran Li, Junpeng Wang, Prince Aboagye, Michael Yeh, Yan Zheng, Liang Wang, Wei Zhang, Kwan-Liu Ma

    Abstract: Recent advancements in pre-trained large-scale language-image models have ushered in a new era of visual comprehension, offering a significant leap forward. These breakthroughs have proven particularly instrumental in addressing long-standing challenges that were previously daunting. Leveraging these innovative techniques, this paper tackles two well-known issues within the realm of visual analyti… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  50. arXiv:2310.16950  [pdf, ps, other

    math.AG

    Stability manifolds of Kuznetsov components of prime Fano threefolds

    Authors: Chang** Fan, Zhiyu Liu, Songtao Kenneth Ma

    Abstract: Let $X$ be a cubic threefold, quartic double solid or Gushel--Mukai threefold, and $\mathcal{K}u(X)\subset \mathrm{D}^b(X)$ be its Kuznetsov component. We show that a stability condition $σ$ on $\mathcal{K}u(X)$ is Serre-invariant if and only if its homological dimension is at most $2$. As a corollary, we prove that all Serre-invariant stability conditions on $\mathcal{K}u(X)$ form a contractible… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 19 pages, comments are very welcome!