Skip to main content

Showing 1–14 of 14 results for author: Gong, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.01606  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    On Discrete Prompt Optimization for Diffusion Models

    Authors: Ruochen Wang, Ting Liu, Cho-Jui Hsieh, Boqing Gong

    Abstract: This paper introduces the first gradient-based framework for prompt optimization in text-to-image diffusion models. We formulate prompt engineering as a discrete optimization problem over the language space. Two major challenges arise in efficiently finding a solution to this problem: (1) Enormous Domain Space: Setting the domain to the entire language space poses significant difficulty to the opt… ▽ More

    Submitted 26 June, 2024; originally announced July 2024.

    Comments: ICML 2024. Code available at https://github.com/ruocwang/dpo-diffusion

    MSC Class: 68T01

    Journal ref: Proceedings of the 41st International Conference on Machine Learning (ICML 2024)

  2. arXiv:2207.01813  [pdf, other

    q-bio.QM stat.AP

    Stochastic Variational Methods in Generalized Hidden Semi-Markov Models to Characterize Functionality in Random Heteropolymers

    Authors: Yun Zhou, Boying Gong, Tao Jiang, Ting Xu, Haiyan Huang

    Abstract: Recent years have seen substantial advances in the development of biofunctional materials using synthetic polymers. The growing problem of elusive sequence-functionality relations for most biomaterials has driven researchers to seek more effective tools and analysis methods. In this study, statistical models are used to study sequence features of the recently reported random heteropolymers (RHP),… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

  3. arXiv:2005.00570  [pdf, ps, other

    cs.LG cs.CV stat.ML

    When Ensembling Smaller Models is More Efficient than Single Large Models

    Authors: Dan Kondratyuk, Mingxing Tan, Matthew Brown, Boqing Gong

    Abstract: Ensembling is a simple and popular technique for boosting evaluation performance by training multiple models (e.g., with different initializations) and aggregating their predictions. This approach is commonly reserved for the largest models, as it is commonly held that increasing the model size provides a more substantial reduction in error than ensembling smaller models. However, we show results… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

  4. arXiv:2003.10780  [pdf, other

    cs.CV cs.LG stat.ML

    Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition from a Domain Adaptation Perspective

    Authors: Muhammad Abdullah Jamal, Matthew Brown, Ming-Hsuan Yang, Liqiang Wang, Boqing Gong

    Abstract: Object frequency in the real world often follows a power law, leading to a mismatch between datasets with long-tailed class distributions seen by a machine learning model and our expectation of the model to perform well on all classes. We analyze this mismatch from a domain adaptation point of view. First of all, we connect existing class-balanced methods for long-tailed classification to target s… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

    Comments: Accepted for publication at CVPR2020

  5. arXiv:2001.02378  [pdf, other

    cs.LG cs.CR stat.ML

    MACER: Attack-free and Scalable Robust Training via Maximizing Certified Radius

    Authors: Runtian Zhai, Chen Dan, Di He, Huan Zhang, Boqing Gong, Pradeep Ravikumar, Cho-Jui Hsieh, Liwei Wang

    Abstract: Adversarial training is one of the most popular ways to learn robust models but is usually attack-dependent and time costly. In this paper, we propose the MACER algorithm, which learns robust models without using adversarial training but performs better than all existing provable l2-defenses. Recent work shows that randomized smoothing can be used to provide a certified l2 radius to smoothed class… ▽ More

    Submitted 14 March, 2022; v1 submitted 8 January, 2020; originally announced January 2020.

    Comments: Published in ICLR 2020. 20 Pages

  6. arXiv:1909.03403  [pdf, other

    cs.CV cs.LG stat.ML

    Open Compound Domain Adaptation

    Authors: Ziwei Liu, Zhongqi Miao, Xingang Pan, Xiaohang Zhan, Dahua Lin, Stella X. Yu, Boqing Gong

    Abstract: A typical domain adaptation approach is to adapt models trained on the annotated data in a source domain (e.g., sunny weather) for achieving high performance on the test data in a target domain (e.g., rainy weather). Whether the target contains a single homogeneous domain or multiple heterogeneous domains, existing works always assume that there exist clear distinctions between the domains, which… ▽ More

    Submitted 29 March, 2020; v1 submitted 8 September, 2019; originally announced September 2019.

    Comments: To appear in CVPR 2020 as an oral presentation. Code, datasets and models are available at: https://liuziwei7.github.io/projects/CompoundDomain.html

  7. arXiv:1905.00441  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    NATTACK: Learning the Distributions of Adversarial Examples for an Improved Black-Box Attack on Deep Neural Networks

    Authors: Yandong Li, Lijun Li, Liqiang Wang, Tong Zhang, Boqing Gong

    Abstract: Powerful adversarial attack methods are vital for understanding how to construct robust deep neural networks (DNNs) and for thoroughly testing defense techniques. In this paper, we propose a black-box adversarial attack algorithm that can defeat both vanilla DNNs and those generated by various defense techniques developed recently. Instead of searching for an "optimal" adversarial example for a be… ▽ More

    Submitted 9 December, 2019; v1 submitted 1 May, 2019; originally announced May 2019.

  8. arXiv:1904.03276  [pdf, other

    cs.LG stat.ML

    Synthesized Policies for Transfer and Adaptation across Tasks and Environments

    Authors: Hexiang Hu, Liyu Chen, Boqing Gong, Fei Sha

    Abstract: The ability to transfer in reinforcement learning is key towards building an agent of general artificial intelligence. In this paper, we consider the problem of learning to simultaneously transfer across both environments (ENV) and tasks (TASK), probably more importantly, by learning from only sparse (ENV, TASK) pairs out of all the possible combinations. We propose a novel compositional neural ne… ▽ More

    Submitted 26 May, 2021; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: presented at NeurIPS 2018 as a Spotlight

  9. arXiv:1902.09255  [pdf, other

    cs.LG cs.SI stat.ML

    Joint Modeling of Dense and Incomplete Trajectories for Citywide Traffic Volume Inference

    Authors: Xianfeng Tang, Boqing Gong, Yanwei Yu, Huaxiu Yao, Yandong Li, Haiyong Xie, Xiaoyu Wang

    Abstract: Real-time traffic volume inference is key to an intelligent city. It is a challenging task because accurate traffic volumes on the roads can only be measured at certain locations where sensors are installed. Moreover, the traffic evolves over time due to the influences of weather, events, holidays, etc. Existing solutions to the traffic volume inference problem often rely on dense GPS trajectories… ▽ More

    Submitted 25 February, 2019; originally announced February 2019.

    Comments: Accepted by The Web Conference (WWW) 2019

  10. arXiv:1807.10957  [pdf, other

    cs.LG stat.ML

    Improving Sequential Determinantal Point Processes for Supervised Video Summarization

    Authors: Aidean Sharghi, Ali Borji, Chengtao Li, Tianbao Yang, Boqing Gong

    Abstract: It is now much easier than ever before to produce videos. While the ubiquitous video data is a great source for information discovery and extraction, the computational challenges are unparalleled. Automatically summarizing the videos has become a substantial need for browsing, searching, and indexing visual content. This paper is in the vein of supervised video summarization using sequential deter… ▽ More

    Submitted 24 October, 2018; v1 submitted 28 July, 2018; originally announced July 2018.

  11. arXiv:1803.01541  [pdf, other

    cs.CV cs.LG stat.ML

    Improving the Improved Training of Wasserstein GANs: A Consistency Term and Its Dual Effect

    Authors: Xiang Wei, Boqing Gong, Zixia Liu, Wei Lu, Liqiang Wang

    Abstract: Despite being impactful on a variety of problems and applications, the generative adversarial nets (GANs) are remarkably difficult to train. This issue is formally analyzed by \cite{arjovsky2017towards}, who also propose an alternative direction to avoid the caveats in the minmax two-player training of GANs. The corresponding algorithm, called Wasserstein GAN (WGAN), hinges on the 1-Lipschitz cont… ▽ More

    Submitted 5 March, 2018; originally announced March 2018.

    Comments: Accepted as a conference paper in International Conference on Learning Representation(ICLR). Xiang Wei and Boqing Gong contributed equally in this work

  12. arXiv:1802.01549  [pdf, other

    cs.LG cs.AI stat.ML

    Blind Pre-Processing: A Robust Defense Method Against Adversarial Examples

    Authors: Adnan Siraj Rakin, Zhezhi He, Boqing Gong, Deliang Fan

    Abstract: Deep learning algorithms and networks are vulnerable to perturbed inputs which is known as the adversarial attack. Many defense methodologies have been investigated to defend against such adversarial attack. In this work, we propose a novel methodology to defend the existing powerful attack model. We for the first time introduce a new attacking scheme for the attacker and set a practical constrain… ▽ More

    Submitted 7 February, 2018; v1 submitted 5 February, 2018; originally announced February 2018.

  13. arXiv:1602.02220  [pdf, ps, other

    cs.LG stat.ML

    Improved Dropout for Shallow and Deep Learning

    Authors: Zhe Li, Boqing Gong, Tianbao Yang

    Abstract: Dropout has been witnessed with great success in training deep neural networks by independently zeroing out the outputs of neurons at random. It has also received a surge of interest for shallow learning, e.g., logistic regression. However, the independent sampling for dropout could be suboptimal for the sake of convergence. In this paper, we propose to use multinomial sampling for dropout, i.e.,… ▽ More

    Submitted 4 December, 2016; v1 submitted 6 February, 2016; originally announced February 2016.

    Comments: In NIPS 2016

  14. arXiv:1411.1537  [pdf, ps, other

    stat.ML cs.CV cs.LG

    Large-Margin Determinantal Point Processes

    Authors: Boqing Gong, Wei-lun Chao, Kristen Grauman, Fei Sha

    Abstract: Determinantal point processes (DPPs) offer a powerful approach to modeling diversity in many applications where the goal is to select a diverse subset. We study the problem of learning the parameters (the kernel matrix) of a DPP from labeled training data. We make two contributions. First, we show how to reparameterize a DPP's kernel matrix with multiple kernel functions, thus enhancing modeling f… ▽ More

    Submitted 7 November, 2014; v1 submitted 6 November, 2014; originally announced November 2014.

    Comments: 15 pages