Skip to main content

Showing 1–50 of 307 results for author: Hong, M

.
  1. arXiv:2407.05514  [pdf, ps, other

    math.PR

    Exact convergence rates to derivatives of local time for some self-similar Gaussian processes

    Authors: Minhao Hong

    Abstract: In this article, for some $d-$dimensional Gaussian processes \[X=\big\{X_t=(X^1_t,\cdots,X^d_t):t\ge0\big\},\] whose components are i.i.d. $1-$dimensional self-similar Gaussian process with Hurst index $H\in(0,1)$, we consider the asymptotic behavior of approximation of its $\boldsymbol{k}-$th derivatives of local time under certain mild conditions, where $\boldsymbol{k}=(k_1,\cdots,k_d)$ an… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 21 pages

  2. arXiv:2407.02906  [pdf, other

    cs.CV

    Single Image Rolling Shutter Removal with Diffusion Models

    Authors: Zhanglei Yang, Haipeng Li, Mingbo Hong, Bing Zeng, Shuaicheng Liu

    Abstract: We present RS-Diffusion, the first Diffusion Models-based method for single-frame Rolling Shutter (RS) correction. RS artifacts compromise visual quality of frames due to the row wise exposure of CMOS sensors. Most previous methods have focused on multi-frame approaches, using temporal information from consecutive frames for the motion rectification. However, few approaches address the more challe… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  3. arXiv:2407.00817  [pdf

    cs.AR

    Multi-Objective Optimization for Common-Centroid Placement of Analog Transistors

    Authors: Supriyo Maji, Hyungjoo Park, Gi moon Hong, Souradip Poddar, David Z. Pan

    Abstract: In analog circuits, process variation can cause unpredictability in circuit performance. Common-centroid (CC) type layouts have been shown to mitigate process-induced variations and are widely used to match circuit elements. Nevertheless, selecting the most suitable CC topology necessitates careful consideration of important layout constraints. Manual handling of these constraints becomes challeng… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  4. arXiv:2406.14017  [pdf, other

    cs.IR

    EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration

    Authors: Ye Wang, Jiahao Xun, Minjie Hong, Jieming Zhu, Tao **, Wang Lin, Haoyuan Li, Linjun Li, Yan Xia, Zhou Zhao, Zhenhua Dong

    Abstract: Generative retrieval has recently emerged as a promising approach to sequential recommendation, framing candidate item retrieval as an autoregressive sequence generation problem. However, existing generative methods typically focus solely on either behavioral or semantic aspects of item information, neglecting their complementary nature and thus resulting in limited effectiveness. To address this… ▽ More

    Submitted 3 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted by KDD 2024. Code available at https://reczoo.github.io/EAGER

  5. arXiv:2406.09841  [pdf, other

    cs.LG q-bio.BM

    Learning Multi-view Molecular Representations with Structured and Unstructured Knowledge

    Authors: Yizhen Luo, Kai Yang, Massimo Hong, Xing Yi Liu, Zikun Nie, Hao Zhou, Zaiqing Nie

    Abstract: Capturing molecular knowledge with representation learning approaches holds significant potential in vast scientific fields such as chemistry and life science. An effective and generalizable molecular representation is expected to capture the consensus and complementary molecular expertise from diverse views and perspectives. However, existing works fall short in learning multi-view molecular repr… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 12 pages, 4 figures

  6. arXiv:2406.06874  [pdf, other

    cs.AI cs.HC cs.RO

    Joint Demonstration and Preference Learning Improves Policy Alignment with Human Feedback

    Authors: Chenliang Li, Siliang Zeng, Zeyi Liao, Jiaxiang Li, Dongyeop Kang, Alfredo Garcia, Mingyi Hong

    Abstract: Aligning human preference and value is an important requirement for building contemporary foundation models and embodied AI. However, popular approaches such as reinforcement learning with human feedback (RLHF) break down the task into successive stages, such as supervised fine-tuning (SFT), reward modeling (RM), and reinforcement learning (RL), each performing one specific learning task. Such a s… ▽ More

    Submitted 19 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  7. arXiv:2406.02214  [pdf, other

    cs.LG

    SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining

    Authors: Andi Han, Jiaxiang Li, Wei Huang, Mingyi Hong, Akiko Takeda, Pratik Jawanpuria, Bamdev Mishra

    Abstract: Large language models (LLMs) have shown impressive capabilities across various tasks. However, training LLMs from scratch requires significant computational power and extensive memory capacity. Recent studies have explored low-rank structures on weights for efficient fine-tuning in terms of parameters and memory, either through low-rank adaptation or factorization. While effective for fine-tuning,… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  8. arXiv:2405.18881  [pdf, other

    cs.LG cs.AI

    Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization

    Authors: Zhiwei Tang, Jiangweizhi Peng, Jiasheng Tang, Mingyi Hong, Fan Wang, Tsung-Hui Chang

    Abstract: In this work, we focus on the alignment problem of diffusion models with a continuous reward function, which represents specific objectives for downstream tasks, such as improving human preference. The central goal of the alignment problem is to adjust the distribution learned by diffusion models such that the generated samples maximize the target reward function. We propose a novel alignment appr… ▽ More

    Submitted 3 July, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  9. arXiv:2405.17888  [pdf, other

    cs.AI

    Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment

    Authors: Jiaxiang Li, Siliang Zeng, Hoi-To Wai, Chenliang Li, Alfredo Garcia, Mingyi Hong

    Abstract: Aligning human preference and value is an important requirement for contemporary foundation models. State-of-the-art techniques such as Reinforcement Learning from Human Feedback (RLHF) often consist of two stages: 1) supervised fine-tuning (SFT), where the model is fine-tuned by learning from human demonstration data; 2) Preference learning, where preference data is used to learn a reward model,… ▽ More

    Submitted 29 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  10. arXiv:2405.15234  [pdf, other

    cs.CV cs.CR

    Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models

    Authors: Yimeng Zhang, Xin Chen, **ghan Jia, Yihua Zhang, Chongyu Fan, Jiancheng Liu, Mingyi Hong, Ke Ding, Sijia Liu

    Abstract: Diffusion models (DMs) have achieved remarkable success in text-to-image generation, but they also pose safety risks, such as the potential generation of harmful content and copyright violations. The techniques of machine unlearning, also known as concept erasing, have been developed to address these risks. However, these techniques remain vulnerable to adversarial prompt attacks, which can prompt… ▽ More

    Submitted 14 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: Codes are available at https://github.com/OPTML-Group/AdvUnlearn

  11. arXiv:2405.12794  [pdf, other

    quant-ph physics.optics

    Multiphoton Quantum Imaging using Natural Light

    Authors: Fatemeh Mostafavi, Mingyuan Hong, Riley B. Dawkins, Jannatul Ferdous, Rui-Bo **, Roberto de J. Leon-Montiel, Chenglong You, Omar S. Magana-Loaiza

    Abstract: It is thought that schemes for quantum imaging are fragile against realistic environments in which the background noise is often stronger than the nonclassical signal of the imaging photons. Unfortunately, it is unfeasible to produce brighter quantum light sources to alleviate this problem. Here, we overcome this paradigmatic limitation by develo** a quantum imaging scheme that relies on the use… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  12. arXiv:2405.04360  [pdf, other

    quant-ph

    Polarization-entangled photon pair source using beam displacers and thin crystals

    Authors: Minjae Hong, Rodrigo Gomez, Valerio Flavio Gili, Jorge Fuenzalida, Markus Gräfe

    Abstract: We present an experimental implementation of a polarization-entangled photon pair source based on beam displacers. The down-converted photons are emitted via spontaneous parametric down-conversion in a non-degenerate and type-0 process. We obtain a state fidelity of F=0.975$\pm$0.004 and violate a Clauser-Horne Shimony-Holt inequality with S=2.75$\pm$0.01. Our source also uses thin crystals for ap… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 8 pages, 5 figures

  13. arXiv:2404.10575  [pdf, other

    cs.LG cs.AI cs.CV math.OC

    EMC$^2$: Efficient MCMC Negative Sampling for Contrastive Learning with Global Convergence

    Authors: Chung-Yiu Yau, Hoi-To Wai, Parameswaran Raman, Soumajyoti Sarkar, Mingyi Hong

    Abstract: A key challenge in contrastive learning is to generate negative samples from a large sample set to contrast with positive samples, for learning better encoding of the data. These negative samples often follow a softmax distribution which are dynamically updated during the training process. However, sampling from this distribution is non-trivial due to the high computational costs in computing the… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 20 pages

  14. arXiv:2404.09800  [pdf, ps, other

    math.PR

    Fractional derivatives of local times for some Gaussian processes

    Authors: Minhao Hong, Qian Yu

    Abstract: In this article, we consider fractional derivatives of local time for $d-$dimensional centered Gaussian processes satisfying certain strong local nondeterminism property. We first give a condition for existence of fractional derivatives of the local time defined by Marchaud derivatives in $L^p(p\ge1)$ and show that these derivatives are Hölder continuous with respect to both time and space variabl… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  15. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  16. arXiv:2403.18774  [pdf, other

    cs.CV cs.CR cs.LG

    RAW: A Robust and Agile Plug-and-Play Watermark Framework for AI-Generated Images with Provable Guarantees

    Authors: Xun Xian, Ganghua Wang, Xuan Bi, Jayanth Srinivasa, Ashish Kundu, Mingyi Hong, Jie Ding

    Abstract: Safeguarding intellectual property and preventing potential misuse of AI-generated images are of paramount importance. This paper introduces a robust and agile plug-and-play watermark detection framework, dubbed as RAW. As a departure from traditional encoder-decoder methods, which incorporate fixed binary codes as watermarks within latent representations, our approach introduces learnable waterma… ▽ More

    Submitted 23 January, 2024; originally announced March 2024.

  17. arXiv:2403.17201  [pdf, other

    quant-ph physics.optics

    Emergence of multiphoton quantum coherence by light propagation

    Authors: Jannatul Ferdous, Mingyuan Hong, Riley B. Dawkins, Fatemeh Mostafavi, Alina Oktyabrskaya, Chenglong You, Roberto de J. León-Montiel, Omar S. Magaña-Loaiza

    Abstract: The modification of the quantum properties of coherence of photons through their interaction with matter lies at the heart of the quantum theory of light. Indeed, the absorption and emission of photons by atoms can lead to different kinds of light with characteristic quantum statistical properties. As such, different types of light are typically associated with distinct sources. Here, we report on… ▽ More

    Submitted 5 April, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  18. arXiv:2403.09868  [pdf, other

    quant-ph physics.optics

    The Quantum Gaussian-Schell Model: A Link Between Classical and Quantum Optics

    Authors: Riley B. Dawkins, Mingyuan Hong, Chenglong You, Omar S. Magana-Loaiza

    Abstract: The quantum theory of the electromagnetic field uncovered that classical forms of light were indeed produced by distinct superpositions of nonclassical multiphoton wavepackets. Specifically, partially coherent light represents the most common kind of classical light. Here, for the first time, we demonstrate the extraction of the constituent multiphoton quantum systems of a partially coherent light… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 11 pages, 2 figures

  19. arXiv:2403.00282  [pdf, other

    cs.LG

    Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning

    Authors: Dohyeong Kim, Mineui Hong, Jeongho Park, Songhwai Oh

    Abstract: In many real-world applications, a reinforcement learning (RL) agent should consider multiple objectives and adhere to safety guidelines. To address these considerations, we propose a constrained multi-objective RL algorithm named Constrained Multi-Objective Gradient Aggregator (CoMOGA). In the field of multi-objective optimization, managing conflicts between the gradients of the multiple objectiv… ▽ More

    Submitted 31 May, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: 25 pages

  20. arXiv:2402.18752  [pdf, other

    cs.LG cs.CR

    Pre-training Differentially Private Models with Limited Public Data

    Authors: Zhiqi Bu, Xinwei Zhang, Mingyi Hong, Sheng Zha, George Karypis

    Abstract: The superior performance of large foundation models relies on the use of massive amounts of high-quality data, which often contain sensitive, private and copyrighted material that requires formal protection. While differential privacy (DP) is a prominent method to gauge the degree of security provided to the models, its application is commonly limited to the model fine-tuning stage, due to the per… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  21. arXiv:2402.15997  [pdf, other

    cs.HC cs.GR cs.LG

    Cieran: Designing Sequential Colormaps via In-Situ Active Preference Learning

    Authors: Matt-Heun Hong, Zachary N. Sunberg, Danielle Albers Szafir

    Abstract: Quality colormaps can help communicate important data patterns. However, finding an aesthetically pleasing colormap that looks "just right" for a given scenario requires significant design and technical expertise. We introduce Cieran, a tool that allows any data analyst to rapidly find quality colormaps while designing charts within Jupyter Notebooks. Our system employs an active preference learni… ▽ More

    Submitted 29 February, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: CHI 2024. 12 pages/9 figures

  22. arXiv:2402.11592  [pdf, other

    cs.LG cs.CL

    Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark

    Authors: Yihua Zhang, **zhi Li, Junyuan Hong, Jiaxiang Li, Yimeng Zhang, Wenqing Zheng, Pin-Yu Chen, Jason D. Lee, Wotao Yin, Mingyi Hong, Zhangyang Wang, Sijia Liu, Tianlong Chen

    Abstract: In the evolving landscape of natural language processing (NLP), fine-tuning pre-trained Large Language Models (LLMs) with first-order (FO) optimizers like SGD and Adam has become standard. Yet, as LLMs grow {in size}, the substantial memory overhead from back-propagation (BP) for FO gradient computation presents a significant challenge. Addressing this issue is crucial, especially for applications… ▽ More

    Submitted 27 May, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  23. arXiv:2402.11424  [pdf, other

    cs.CV cs.AI

    Data Distribution Distilled Generative Model for Generalized Zero-Shot Recognition

    Authors: Yijie Wang, Mingjian Hong, Luwen Huangfu, Sheng Huang

    Abstract: In the realm of Zero-Shot Learning (ZSL), we address biases in Generalized Zero-Shot Learning (GZSL) models, which favor seen data. To counter this, we introduce an end-to-end generative GZSL framework called D$^3$GZSL. This framework respects seen and synthesized unseen data as in-distribution and out-of-distribution data, respectively, for a more balanced model. D$^3$GZSL comprises two core modu… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: accepted as AAAI 2024 oral paper

  24. arXiv:2402.08821  [pdf, other

    math.OC cs.DC

    Problem-Parameter-Free Decentralized Nonconvex Stochastic Optimization

    Authors: Jiaxiang Li, Xuxing Chen, Shiqian Ma, Mingyi Hong

    Abstract: Existing decentralized algorithms usually require knowledge of problem parameters for updating local iterates. For example, the hyperparameters (such as learning rate) usually require the knowledge of Lipschitz constant of the global gradient or topological information of the communication networks, which are usually not accessible in practice. In this paper, we propose D-NASA, the first algorithm… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  25. arXiv:2402.05358  [pdf, other

    gr-qc astro-ph.CO hep-ph

    Starobinsky Inflation and beyond in Einstein-Cartan Gravity

    Authors: Minxi He, Muzi Hong, Kyohei Mukaida

    Abstract: We show that various types of scalaron-induced inflation, including the Starobinsky inflation, can be realized in the Einstein-Cartan gravity with the Nieh-Yan term and/or the Holst term. Einstein-Cartan $f(R)$ theory is known not to induce an additional scalar degree of freedom, the scalaron, contrary to the case in the metric formalism. However, there exist geometric quantities other than the Ri… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 24 pages, 2 figures

    Report number: KEK-TH-2598, RESCEU-3/24, CTPU-PTC-24-05

    Journal ref: JCAP 05 (2024) 107

  26. arXiv:2401.12025  [pdf, other

    cs.IT eess.SP math.OC

    A Survey of Recent Advances in Optimization Methods for Wireless Communications

    Authors: Ya-Feng Liu, Tsung-Hui Chang, Mingyi Hong, Zheyu Wu, Anthony Man-Cho So, Eduard A. Jorswieck, Wei Yu

    Abstract: Mathematical optimization is now widely regarded as an indispensable modeling and solution tool for the design of wireless communications systems. While optimization has played a significant role in the revolutionary progress in wireless communication and networking technologies from 1G to 5G and onto the future 6G, the innovations in wireless technologies have also substantially transformed the n… ▽ More

    Submitted 7 June, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: 39 pages, 5 figures, accepted for publication in IEEE Journal on Selected Areas in Communications

  27. arXiv:2401.11380  [pdf, other

    cs.LG math.ST stat.ME stat.ML

    MoMA: Model-based Mirror Ascent for Offline Reinforcement Learning

    Authors: Mao Hong, Zhiyue Zhang, Yue Wu, Yanxun Xu

    Abstract: Model-based offline reinforcement learning methods (RL) have achieved state-of-the-art performance in many decision-making problems thanks to their sample efficiency and generalizability. Despite these advancements, existing model-based offline RL approaches either focus on theoretical studies without develo** practical algorithms or rely on a restricted parametric policy space, thus not fully l… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  28. arXiv:2401.08893  [pdf, other

    cs.LG math.OC

    MADA: Meta-Adaptive Optimizers through hyper-gradient Descent

    Authors: Kaan Ozkara, Can Karakus, Parameswaran Raman, Mingyi Hong, Shoham Sabach, Branislav Kveton, Volkan Cevher

    Abstract: Following the introduction of Adam, several novel adaptive optimizers for deep learning have been proposed. These optimizers typically excel in some tasks but may not outperform Adam uniformly across all tasks. In this work, we introduce Meta-Adaptive Optimizers (MADA), a unified optimizer framework that can generalize several known optimizers and dynamically learn the most suitable one during tra… ▽ More

    Submitted 17 June, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  29. arXiv:2401.04133  [pdf, other

    cs.LG cs.AI cs.SI

    SynHING: Synthetic Heterogeneous Information Network Generation for Graph Learning and Explanation

    Authors: Ming-Yi Hong, Yi-Hsiang Huang, Shao-En Lin, You-Chen Teng, Chih-Yu Wang, Che Lin

    Abstract: Graph Neural Networks (GNNs) excel in delineating graph structures in diverse domains, including community analysis and recommendation systems. As the interpretation of GNNs becomes increasingly important, the demand for robust baselines and expansive graph datasets is accentuated, particularly in the context of Heterogeneous Information Networks (HIN). Addressing this, we introduce SynHING, a nov… ▽ More

    Submitted 29 May, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: Update figures, tables, and content

  30. arXiv:2401.03058  [pdf, other

    math.OC cs.LG stat.ML

    Krylov Cubic Regularized Newton: A Subspace Second-Order Method with Dimension-Free Convergence Rate

    Authors: Ruichen Jiang, Parameswaran Raman, Shoham Sabach, Aryan Mokhtari, Mingyi Hong, Volkan Cevher

    Abstract: Second-order optimization methods, such as cubic regularized Newton methods, are known for their rapid convergence rates; nevertheless, they become impractical in high-dimensional problems due to their substantial memory requirements and computational costs. One promising approach is to execute second-order updates within a lower-dimensional subspace, giving rise to subspace second-order methods.… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 27 pages, 2 figures

  31. arXiv:2312.11388  [pdf, other

    cs.HC

    BioSpark: An End-to-End Generative System for Biological-Analogical Inspirations and Ideation

    Authors: Hyeonsu B. Kang, David Chuan-En Lin, Nikolas Martelaro, Aniket Kittur, Yan-Ying Chen, Matthew K. Hong

    Abstract: Nature is often used to inspire solutions for complex engineering problems, but achieving its full potential is challenging due to difficulties in discovering relevant analogies and synthesizing from them. Here, we present an end-to-end system, BioSpark, that generates biological-analogical mechanisms and provides an interactive interface to comprehend and synthesize from them. BioSpark pipeline s… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023 Workshop on Machine Learning for Creativity and Design

  32. arXiv:2312.06519  [pdf, other

    cs.LG cs.AI cs.SI

    A GAN Approach for Node Embedding in Heterogeneous Graphs Using Subgraph Sampling

    Authors: Hung Chun Hsu, Bo-Jun Wu, Ming-Yi Hong, Che Lin, Chih-Yu Wang

    Abstract: Our research addresses class imbalance issues in heterogeneous graphs using graph neural networks (GNNs). We propose a novel method combining the strengths of Generative Adversarial Networks (GANs) with GNNs, creating synthetic nodes and edges that effectively balance the dataset. This approach directly targets and rectifies imbalances at the data level. The proposed framework resolves issues such… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  33. arXiv:2312.03395  [pdf, other

    cs.RO cs.AI cs.LG

    Diffused Task-Agnostic Milestone Planner

    Authors: Mineui Hong, Minjae Kang, Songhwai Oh

    Abstract: Addressing decision-making problems using sequence modeling to predict future trajectories shows promising results in recent years. In this paper, we take a step further to leverage the sequence predictive method in wider areas such as long-term planning, vision-based control, and multi-task decision-making. To this end, we propose a method to utilize a diffusion-based generative sequence model to… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: 37th Conference on Neural Information Processing Systems

  34. arXiv:2311.14632  [pdf, other

    cs.LG cs.CR

    Differentially Private SGD Without Clip** Bias: An Error-Feedback Approach

    Authors: Xinwei Zhang, Zhiqi Bu, Zhiwei Steven Wu, Mingyi Hong

    Abstract: Differentially Private Stochastic Gradient Descent with Gradient Clip** (DPSGD-GC) is a powerful tool for training deep learning models using sensitive data, providing both a solid theoretical privacy guarantee and high efficiency. However, using DPSGD-GC to ensure Differential Privacy (DP) comes at the cost of model performance degradation due to DP noise injection and gradient clip**. Existi… ▽ More

    Submitted 17 April, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

  35. arXiv:2311.05590  [pdf, other

    cs.HC cs.AI

    Conversational AI Threads for Visualizing Multidimensional Datasets

    Authors: Matt-Heun Hong, Anamaria Crisan

    Abstract: Generative Large Language Models (LLMs) show potential in data analysis, yet their full capabilities remain uncharted. Our work explores the capabilities of LLMs for creating and refining visualizations via conversational interfaces. We used an LLM to conduct a re-analysis of a prior Wizard-of-Oz study examining the use of chatbots for conducting visual analysis. We surfaced the strengths and weak… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  36. arXiv:2310.10780  [pdf, other

    cs.CR cs.AI cs.LG

    Demystifying Poisoning Backdoor Attacks from a Statistical Perspective

    Authors: Ganghua Wang, Xun Xian, Jayanth Srinivasa, Ashish Kundu, Xuan Bi, Mingyi Hong, Jie Ding

    Abstract: The growing dependence on machine learning in real-world applications emphasizes the importance of understanding and ensuring its safety. Backdoor attacks pose a significant security risk due to their stealthy nature and potentially serious consequences. Such attacks involve embedding triggers within a learning model with the intention of causing malicious behavior when an active trigger is presen… ▽ More

    Submitted 17 October, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

  37. arXiv:2310.08782  [pdf, other

    cs.LG cs.AI

    Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning

    Authors: Yihua Zhang, Yimeng Zhang, Aochuan Chen, **ghan Jia, Jiancheng Liu, Gaowen Liu, Mingyi Hong, Shiyu Chang, Sijia Liu

    Abstract: Massive data is often considered essential for deep learning applications, but it also incurs significant computational and infrastructural costs. Therefore, dataset pruning (DP) has emerged as an effective way to improve data efficiency by identifying and removing redundant training samples without sacrificing performance. In this work, we aim to address the problem of DP for transfer learning, i… ▽ More

    Submitted 18 November, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  38. arXiv:2309.08571  [pdf, other

    cs.LG

    A Bayesian Approach to Robust Inverse Reinforcement Learning

    Authors: Ran Wei, Siliang Zeng, Chenliang Li, Alfredo Garcia, Anthony McDonald, Mingyi Hong

    Abstract: We consider a Bayesian approach to offline model-based inverse reinforcement learning (IRL). The proposed framework differs from existing offline model-based IRL approaches by performing simultaneous estimation of the expert's reward function and subjective model of environment dynamics. We make use of a class of prior distributions which parameterizes how accurate the expert's model of the enviro… ▽ More

    Submitted 6 April, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

  39. arXiv:2308.11207  [pdf, other

    hep-ph astro-ph.CO hep-th

    Quartic Gradient Flow

    Authors: Muzi Hong, Ryusuke **no

    Abstract: Saddle-point configurations, such as the Euclidean bounce and sphalerons, are known to be difficult to find numerically. In this Letter we study a new method, Quartic Gradient Flow, to search for such configurations. The central idea is to introduce a gradient-flow-like equation in such a way that all the fluctuations around the saddle-point have eigenvalues that are square of the eigenvalues of t… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: 8 pages, 4 figures

    Report number: RESCEU-16/23

    Journal ref: Phys.Lett.B 849 (2024) 138441

  40. arXiv:2308.00788  [pdf, other

    cs.LG math.OC

    An Introduction to Bi-level Optimization: Foundations and Applications in Signal Processing and Machine Learning

    Authors: Yihua Zhang, Prashant Khanduri, Ioannis Tsaknakis, Yuguang Yao, Mingyi Hong, Sijia Liu

    Abstract: Recently, bi-level optimization (BLO) has taken center stage in some very exciting developments in the area of signal processing (SP) and machine learning (ML). Roughly speaking, BLO is a classical optimization problem that involves two levels of hierarchy (i.e., upper and lower levels), wherein obtaining the solution to the upper-level problem requires solving the lower-level one. BLO has become… ▽ More

    Submitted 20 December, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

  41. arXiv:2307.09484  [pdf, other

    q-bio.BM cs.CE cs.LG physics.chem-ph

    MolFM: A Multimodal Molecular Foundation Model

    Authors: Yizhen Luo, Kai Yang, Massimo Hong, Xing Yi Liu, Zaiqing Nie

    Abstract: Molecular knowledge resides within three different modalities of information sources: molecular structures, biomedical documents, and knowledge bases. Effective incorporation of molecular knowledge from these modalities holds paramount significance in facilitating biomedical research. However, existing multimodal molecular foundation models exhibit limitations in capturing intricate connections be… ▽ More

    Submitted 21 July, 2023; v1 submitted 6 June, 2023; originally announced July 2023.

    Comments: 31 pages, 15 figures, and 15 tables

  42. arXiv:2306.15774  [pdf

    cs.HC cs.CL cs.CV cs.LG

    Next Steps for Human-Centered Generative AI: A Technical Perspective

    Authors: Xiang 'Anthony' Chen, Jeff Burke, Ruofei Du, Matthew K. Hong, Jennifer Jacobs, Philippe Laban, Dingzeyu Li, Nanyun Peng, Karl D. D. Willis, Chien-Sheng Wu, Bolei Zhou

    Abstract: Through iterative, cross-disciplinary discussions, we define and propose next-steps for Human-centered Generative AI (HGAI). We contribute a comprehensive research agenda that lays out future directions of Generative AI spanning three levels: aligning with human values; assimilating human intents; and augmenting human abilities. By identifying these next-steps, we intend to draw interdisciplinary… ▽ More

    Submitted 22 December, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

  43. Illuminating all-hadronic final states with a photon: Exotic decays of the Higgs boson to four bottom quarks in vector boson fusion plus gamma at hadron colliders

    Authors: Stephen T. Roche, Benjamin T. Carlson, Christopher R. Hayes, Tae Min Hong

    Abstract: We investigate the potential to detect Higgs boson decays to four bottom quarks through a pair of pseudoscalars, a final state that is predicted by many theories beyond the Standard Model. For the first time, the signal sensitivity is evaluated for the final state using the vector boson fusion (VBF) production with and without an associated photon, for the Higgs at $m_H=125\,\textrm{GeV}$, at hadr… ▽ More

    Submitted 28 June, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Report number: PITT-PACC-2313

    Journal ref: Phys. Rev. D 109 (2024) 11, 115029

  44. arXiv:2306.01217  [pdf, ps, other

    cs.HC

    Generative AI for Product Design: Getting the Right Design and the Design Right

    Authors: Matthew K. Hong, Shabnam Hakimi, Yan-Ying Chen, Heishiro Toyoda, Charlene Wu, Matt Klenk

    Abstract: Generative AI (GenAI) models excel in their ability to recognize patterns in existing data and generate new and unexpected content. Recent advances have motivated applications of GenAI tools (e.g., Stable Diffusion, ChatGPT) to professional practice across industries, including product design. While these generative capabilities may seem enticing on the surface, certain barriers limit their practi… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  45. arXiv:2305.17083  [pdf, other

    stat.ML cs.LG econ.EM math.ST stat.ME

    A Policy Gradient Method for Confounded POMDPs

    Authors: Mao Hong, Zhengling Qi, Yanxun Xu

    Abstract: In this paper, we propose a policy gradient method for confounded partially observable Markov decision processes (POMDPs) with continuous state and observation spaces in the offline setting. We first establish a novel identification result to non-parametrically estimate any history-dependent policy gradient under POMDPs using the offline data. The identification enables us to solve a sequence of c… ▽ More

    Submitted 30 November, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 95 pages, 3 figures

  46. arXiv:2305.13146  [pdf, ps, other

    math.PR

    Limit theorems for additive functionals of some self-similar Gaussian processes

    Authors: Minhao Hong, Heguang Liu, Fangjun Xu

    Abstract: Under certain mild conditions, limit theorems for additive functionals of some $d$-dimensional self-similar Gaussian processes are obtained. These limit theorems work for general Gaussian processes including fractional Brownian motions, sub-fractional Brownian motions and bi-fractional Brownian motions. To prove these results, we use the method of moments and an enhanced chaining argument. The Gau… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  47. arXiv:2305.12817  [pdf, other

    cs.LG

    Conservative Physics-Informed Neural Networks for Non-Conservative Hyperbolic Conservation Laws Near Critical States

    Authors: Reyna Quita, Yu-Shuo Chen, Hsin-Yi Lee Alex C. Hu, John M. Hong

    Abstract: In this paper, a modified version of conservative Physics-informed Neural Networks (cPINN for short) is provided to construct the weak solutions of Riemann problem for the hyperbolic scalar conservation laws in non-conservative form. To demonstrate the results, we use the model of generalized Buckley-Leverett equation (GBL equation for short) with discontinuous porosity in porous media. By inventi… ▽ More

    Submitted 22 May, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 23 pages, 26 figures

    MSC Class: 35L03; 35L45; 65M99

  48. arXiv:2305.04241  [pdf, other

    cs.CL cs.LG

    Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens

    Authors: Zhanpeng Zeng, Cole Hawkins, Mingyi Hong, Aston Zhang, Nikolaos Pappas, Vikas Singh, Shuai Zheng

    Abstract: Transformers are central in modern natural language processing and computer vision applications. Despite recent works devoted to reducing the quadratic cost of such models (as a function of the sequence length), dealing with ultra long sequences (e.g., with more than 16K tokens) remains challenging. Applications such as answering questions based on a book or summarizing a scientific article are in… ▽ More

    Submitted 27 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: 10 pages main text, 12 pages appendix, preprint

  49. arXiv:2305.01523  [pdf, other

    cs.LG cs.AI cs.CE

    Towards Unified AI Drug Discovery with Multiple Knowledge Modalities

    Authors: Yizhen Luo, Xing Yi Liu, Kai Yang, Kui Huang, Massimo Hong, Jiahuan Zhang, Yushuai Wu, Zaiqing Nie

    Abstract: In recent years, AI models that mine intrinsic patterns from molecular structures and protein sequences have shown promise in accelerating drug discovery. However, these methods partly lag behind real-world pharmaceutical approaches of human experts that additionally grasp structured knowledge from knowledge bases and unstructured knowledge from biomedical literature. To bridge this gap, we propos… ▽ More

    Submitted 14 October, 2023; v1 submitted 17 April, 2023; originally announced May 2023.

    Comments: 10 pages, 6 figures

  50. Baryogenesis from sphaleron decoupling

    Authors: Muzi Hong, Kohei Kamada, Jun'ichi Yokoyama

    Abstract: The electroweak sphaleron process breaks the baryon number conservation within the realms of the Standard Model of particle physics (SM). Recently, it is pointed out that its decoupling may provide the out-of-equilibrium condition required for baryogenesis. In this paper, we study such a scenario taking into account the baryon-number wash-out effect of the sphaleron itself to improve the estimate.… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: 16 pages, 5 figures

    Report number: RESCEU-9/23

    Journal ref: Phys.Rev.D 108 (2023) 6, 063502