Skip to main content

Showing 1–24 of 24 results for author: Jian, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16821  [pdf, other

    cs.LG cs.AI physics.bio-ph physics.chem-ph q-bio.BM

    General Binding Affinity Guidance for Diffusion Models in Structure-Based Drug Design

    Authors: Yue Jian, Curtis Wu, Danny Reidenbach, Aditi S. Krishnapriyan

    Abstract: Structure-Based Drug Design (SBDD) focuses on generating valid ligands that strongly and specifically bind to a designated protein pocket. Several methods use machine learning for SBDD to generate these ligands in 3D space, conditioned on the structure of a desired protein pocket. Recently, diffusion models have shown success here by modeling the underlying distributions of atomic positions and ty… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.06799  [pdf, other

    cs.DC cs.CL

    LLM-dCache: Improving Tool-Augmented LLMs with GPT-Driven Localized Data Caching

    Authors: Simranjit Singh, Michael Fore, Andreas Karatzas, Chaehong Lee, Yanan Jian, Longfei Shangguan, Fuxun Yu, Iraklis Anagnostopoulos, Dimitrios Stamoulis

    Abstract: As Large Language Models (LLMs) broaden their capabilities to manage thousands of API calls, they are confronted with complex data operations across vast datasets with significant overhead to the underlying system. In this work, we introduce LLM-dCache to optimize data accesses by treating cache operations as callable API functions exposed to the tool-augmented agent. We grant LLMs the autonomy to… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  3. arXiv:2311.12345  [pdf, other

    cs.CV cs.AI cs.LG

    Stable Diffusion For Aerial Object Detection

    Authors: Yanan Jian, Fuxun Yu, Simranjit Singh, Dimitrios Stamoulis

    Abstract: Aerial object detection is a challenging task, in which one major obstacle lies in the limitations of large-scale data collection and the long-tail distribution of certain classes. Synthetic data offers a promising solution, especially with recent advances in diffusion-based methods like stable diffusion (SD). However, the direct application of diffusion methods to aerial domains poses unique chal… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023 Synthetic Data Generation with Generative AI workshop

  4. arXiv:2310.03291  [pdf, other

    cs.CV

    Expedited Training of Visual Conditioned Language Generation via Redundancy Reduction

    Authors: Yiren Jian, Tingkai Liu, Yunzhe Tao, Chunhui Zhang, Soroush Vosoughi, Hongxia Yang

    Abstract: In this paper, we introduce $\text{EVL}_{\text{Gen}}$, a streamlined framework designed for the pre-training of visually conditioned language generation models with high computational demands, utilizing frozen pre-trained large language models (LLMs). The conventional approach in vision-language pre-training (VLP) typically involves a two-stage optimization process: an initial resource-intensive p… ▽ More

    Submitted 21 February, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

  5. arXiv:2307.07063  [pdf, other

    cs.CV cs.LG

    Bootstrap** Vision-Language Learning with Decoupled Language Pre-training

    Authors: Yiren Jian, Chongyang Gao, Soroush Vosoughi

    Abstract: We present a novel methodology aimed at optimizing the application of frozen large language models (LLMs) for resource-intensive vision-language (VL) pre-training. The current paradigm uses visual features as prompts to guide language models, with a focus on determining the most relevant visual features for corresponding text. Our approach diverges by concentrating on the language component, speci… ▽ More

    Submitted 19 December, 2023; v1 submitted 13 July, 2023; originally announced July 2023.

    Comments: Accepted to NeurIPS 2023 (spotlight). The code is available at https://github.com/yiren-jian/BLIText

  6. arXiv:2306.08843  [pdf, other

    cs.AI cs.MA

    Real-Time Network-Level Traffic Signal Control: An Explicit Multiagent Coordination Method

    Authors: Wanyuan Wang, Tianchi Qiao, **ming Ma, Jiahui **, Zhibin Li, Weiwei Wu, Yichuan Jian

    Abstract: Efficient traffic signal control (TSC) has been one of the most useful ways for reducing urban road congestion. Key to the challenge of TSC includes 1) the essential of real-time signal decision, 2) the complexity in traffic dynamics, and 3) the network-level coordination. Recent efforts that applied reinforcement learning (RL) methods can query policies by map** the traffic state to the signal… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  7. arXiv:2302.06120  [pdf, other

    q-bio.QM cs.LG

    Knowledge from Large-Scale Protein Contact Prediction Models Can Be Transferred to the Data-Scarce RNA Contact Prediction Task

    Authors: Yiren Jian, Chongyang Gao, Chen Zeng, Yunjie Zhao, Soroush Vosoughi

    Abstract: RNA, whose functionality is largely determined by its structure, plays an important role in many biological activities. The prediction of pairwise structural proximity between each nucleotide of an RNA sequence can characterize the structural information of the RNA. Historically, this problem has been tackled by machine learning models using expert-engineered features and trained on scarce labeled… ▽ More

    Submitted 18 January, 2024; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: The code is available at https://github.com/yiren-jian/CoT-RNA-Transfer

  8. arXiv:2210.01120  [pdf, other

    physics.chem-ph cs.LG

    Predicting CO$_2$ Absorption in Ionic Liquids with Molecular Descriptors and Explainable Graph Neural Networks

    Authors: Yue Jian, Yuyang Wang, Amir Barati Farimani

    Abstract: Ionic Liquids (ILs) provide a promising solution for CO$_2$ capture and storage to mitigate global warming. However, identifying and designing the high-capacity IL from the giant chemical space requires expensive, and exhaustive simulations and experiments. Machine learning (ML) can accelerate the process of searching for desirable ionic molecules through accurate and efficient property prediction… ▽ More

    Submitted 9 November, 2022; v1 submitted 29 September, 2022; originally announced October 2022.

  9. arXiv:2209.09433  [pdf, other

    cs.CL

    Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings

    Authors: Yiren Jian, Chongyang Gao, Soroush Vosoughi

    Abstract: Semantic representation learning for sentences is an important and well-studied problem in NLP. The current trend for this task involves training a Transformer-based sentence encoder through a contrastive objective with text, i.e., clustering sentences with semantically similar meanings and scattering others. In this work, we find the performance of Transformer models as sentence encoders can be i… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: Accepted to NeurIPS 2022

  10. arXiv:2205.01308  [pdf, other

    cs.CL cs.AI

    Contrastive Learning for Prompt-Based Few-Shot Language Learners

    Authors: Yiren Jian, Chongyang Gao, Soroush Vosoughi

    Abstract: The impressive performance of GPT-3 using natural language prompts and in-context learning has inspired work on better fine-tuning of moderately-sized models under this paradigm. Following this line of work, we present a contrastive learning framework that clusters inputs from the same class for better generality of models trained with only limited examples. Specifically, we propose a supervised c… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: accepted to NAACL 2022

  11. arXiv:2205.01307  [pdf, other

    cs.CL cs.AI

    Embedding Hallucination for Few-Shot Language Fine-tuning

    Authors: Yiren Jian, Chongyang Gao, Soroush Vosoughi

    Abstract: Few-shot language learners adapt knowledge from a pre-trained model to recognize novel classes from a few-labeled sentences. In such settings, fine-tuning a pre-trained language model can cause severe over-fitting. In this paper, we propose an Embedding Hallucination (EmbedHalluc) method, which generates auxiliary embedding-label pairs to expand the fine-tuning dataset. The hallucinator is trained… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: accepted to NAACL 2022

  12. arXiv:2112.03340  [pdf, other

    cs.CV cs.LG

    Label Hallucination for Few-Shot Classification

    Authors: Yiren Jian, Lorenzo Torresani

    Abstract: Few-shot classification requires adapting knowledge learned from a large annotated base dataset to recognize novel unseen classes, each represented by few labeled examples. In such a scenario, pretraining a network with high capacity on the large dataset and then finetuning it on the few examples causes severe overfitting. At the same time, training a simple linear classifier on top of "frozen" fe… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI 2022. Code is available: https://github.com/yiren-jian/LabelHalluc

  13. arXiv:2110.01777  [pdf, other

    cs.CV cs.AI

    MetaPix: Domain Transfer for Semantic Segmentation by Meta Pixel Weighting

    Authors: Yiren Jian, Chongyang Gao

    Abstract: Training a deep neural model for semantic segmentation requires collecting a large amount of pixel-level labeled data. To alleviate the data scarcity problem presented in the real world, one could utilize synthetic data whose label is easy to obtain. Previous work has shown that the performance of a semantic segmentation model can be improved by training jointly with real and synthetic examples wi… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Journal ref: Vision and Image Computing, 2021

  14. arXiv:2108.06180  [pdf, other

    cs.AI cs.CV

    SPACE: A Simulator for Physical Interactions and Causal Learning in 3D Environments

    Authors: Jiafei Duan, Samson Yu Bai Jian, Cheston Tan

    Abstract: Recent advancements in deep learning, computer vision, and embodied AI have given rise to synthetic causal reasoning video datasets. These datasets facilitate the development of AI algorithms that can reason about physical interactions between objects. However, datasets thus far have primarily focused on elementary physical events such as rolling or falling. There is currently a scarcity of datase… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 21, Simulation Technology for Embodied AI (SEAI) Workshop

  15. arXiv:2108.06107  [pdf, other

    cs.CL cs.AI

    Aspect Sentiment Triplet Extraction Using Reinforcement Learning

    Authors: Samson Yu Bai Jian, Tapas Nayak, Navonil Majumder, Soujanya Poria

    Abstract: Aspect Sentiment Triplet Extraction (ASTE) is the task of extracting triplets of aspect terms, their associated sentiments, and the opinion terms that provide evidence for the expressed sentiments. Previous approaches to ASTE usually simultaneously extract all three components or first identify the aspect and opinion terms, then pair them up to predict their sentiment polarities. In this work, we… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Comments: CIKM 2021

  16. arXiv:2012.11820  [pdf, other

    cs.CL

    Recognizing Emotion Cause in Conversations

    Authors: Soujanya Poria, Navonil Majumder, Devamanyu Hazarika, Deepanway Ghosal, Rishabh Bhardwaj, Samson Yu Bai Jian, Pengfei Hong, Romila Ghosh, Abhinaba Roy, Niyati Chhaya, Alexander Gelbukh, Rada Mihalcea

    Abstract: We address the problem of recognizing emotion cause in conversations, define two novel sub-tasks of this problem, and provide a corresponding dialogue-level dataset, along with strong Transformer-based baselines. The dataset is available at https://github.com/declare-lab/RECCON. Introduction: Recognizing the cause behind emotions in text is a fundamental yet under-explored area of research in NL… ▽ More

    Submitted 28 July, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: https://github.com/declare-lab/RECCON, Accepted at Cognitive Computation

  17. arXiv:2005.14359  [pdf, other

    cs.LG stat.ML

    Unsupervised Feature Selection via Multi-step Markov Transition Probability

    Authors: Yan Min, Mao Ye, Liang Tian, Yulin Jian, Ce Zhu, Shangming Yang

    Abstract: Feature selection is a widely used dimension reduction technique to select feature subsets because of its interpretability. Many methods have been proposed and achieved good results, in which the relationships between adjacent data points are mainly concerned. But the possible associations between data pairs that are may not adjacent are always neglected. Different from previous methods, we propos… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

  18. arXiv:1911.05920  [pdf, other

    cs.LG cs.CV

    Understanding the Disharmony between Weight Normalization Family and Weight Decay: $ε-$shifted $L_2$ Regularizer

    Authors: Li Xiang, Chen Shuo, Xia Yan, Yang Jian

    Abstract: The merits of fast convergence and potentially better performance of the weight normalization family have drawn increasing attention in recent years. These methods use standardization or normalization that changes the weight $\boldsymbol{W}$ to $\boldsymbol{W}'$, which makes $\boldsymbol{W}'$ independent to the magnitude of $\boldsymbol{W}$. Surprisingly, $\boldsymbol{W}$ must be decayed during gr… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

    Comments: 12 pages, 9 figures

  19. A Simple Proof of Maxwell Saturation for Coupled Scalar Recursions

    Authors: Arvind Yedla, Yung-Yih Jian, Phong S. Nguyen, Henry D. Pfister

    Abstract: Low-density parity-check (LDPC) convolutional codes (or spatially-coupled codes) were recently shown to approach capacity on the binary erasure channel (BEC) and binary-input memoryless symmetric channels. The mechanism behind this spectacular performance is now called threshold saturation via spatial coupling. This new phenomenon is characterized by the belief-propagation threshold of the spatial… ▽ More

    Submitted 11 September, 2014; v1 submitted 30 September, 2013; originally announced September 2013.

    Comments: This article is an extended journal version of arXiv:1204.5703 and has now been accepted to the IEEE Transactions on Information Theory. This version adds additional explanation for some details and also corrects a number of small typos

  20. arXiv:1208.4080  [pdf, ps, other

    cs.IT

    A Simple Proof of Threshold Saturation for Coupled Vector Recursions

    Authors: Arvind Yedla, Yung-Yih Jian, Phong S. Nguyen, Henry D. Pfister

    Abstract: Convolutional low-density parity-check (LDPC) codes (or spatially-coupled codes) have now been shown to achieve capacity on binary-input memoryless symmetric channels. The principle behind this surprising result is the threshold-saturation phenomenon, which is defined by the belief-propagation threshold of the spatially-coupled ensemble saturating to a fundamental threshold defined by the uncouple… ▽ More

    Submitted 24 January, 2013; v1 submitted 20 August, 2012; originally announced August 2012.

    Comments: 7 pages, a slightly extended version of the paper with that appears in the proceedings of ITW 2012

  21. arXiv:1204.5703  [pdf, ps, other

    cs.IT

    A Simple Proof of Threshold Saturation for Coupled Scalar Recursions

    Authors: Arvind Yedla, Yung-Yih Jian, Phong S. Nguyen, Henry D. Pfister

    Abstract: Low-density parity-check (LDPC) convolutional codes (or spatially-coupled codes) have been shown to approach capacity on the binary erasure channel (BEC) and binary-input memoryless symmetric channels. The mechanism behind this spectacular performance is the threshold saturation phenomenon, which is characterized by the belief-propagation threshold of the spatially-coupled ensemble increasing to a… ▽ More

    Submitted 19 October, 2013; v1 submitted 25 April, 2012; originally announced April 2012.

    Comments: In this update, there are a few small changes to Def. 5, Def. 6, and Remark 1. These changes avoid a pathological counterexample that is described in arXiv:1309.7910. The original version appears in the proceedings of ISTC 2012

  22. arXiv:1202.6095  [pdf, other

    cs.IT

    Approaching Capacity at High-Rates with Iterative Hard-Decision Decoding

    Authors: Yung-Yih Jian, Henry D. Pfister, Krishna R. Narayanan

    Abstract: A variety of low-density parity-check (LDPC) ensembles have now been observed to approach capacity with message-passing decoding. However, all of them use soft (i.e., non-binary) messages and a posteriori probability (APP) decoding of their component codes. In this paper, we show that one can approach capacity at high rates using iterative hard-decision decoding (HDD) of generalized product codes.… ▽ More

    Submitted 17 May, 2017; v1 submitted 27 February, 2012; originally announced February 2012.

    Comments: 22 pages, this version accepted to the IEEE Transactions on Information Theory

  23. arXiv:1107.3177  [pdf, ps, other

    cs.IT

    Convergence of Weighted Min-Sum Decoding Via Dynamic Programming on Trees

    Authors: Yung-Yih Jian, Henry D. Pfister

    Abstract: Applying the max-product (and belief-propagation) algorithms to loopy graphs is now quite popular for best assignment problems. This is largely due to their low computational complexity and impressive performance in practice. Still, there is no general understanding of the conditions required for convergence and/or the optimality of converged solutions. This paper presents an analysis of both atte… ▽ More

    Submitted 15 July, 2011; originally announced July 2011.

    Comments: 43 pages, 3 figures

  24. arXiv:cs/9902016  [pdf

    cs.DL

    Multimedia Description Framework (MDF) for Content Description of Audio/Video Documents

    Authors: Michael J. Hu, Ye Jian

    Abstract: MPEG is undertaking a new initiative to standardize content description of audio and video data/documents. When it is finalized in 2001, MPEG-7 is expected to provide standardized description schemes for concise and unambiguous content description of data/documents of complex media types. Meanwhile, other meta-data or description schemes, such as Dublin Core, XML, etc., are becoming popular in d… ▽ More

    Submitted 8 February, 1999; originally announced February 1999.

    Comments: 20 pages

    ACM Class: H3.3; H3.7