Skip to main content

Showing 1–8 of 8 results for author: Vydiswaran, V

.
  1. arXiv:2402.15000  [pdf, other

    cs.CL cs.LG

    Divide-or-Conquer? Which Part Should You Distill Your LLM?

    Authors: Zhuofeng Wu, He Bai, Aonan Zhang, Jiatao Gu, VG Vinod Vydiswaran, Navdeep Jaitly, Yizhe Zhang

    Abstract: Recent methods have demonstrated that Large Language Models (LLMs) can solve reasoning tasks better when they are encouraged to solve subtasks of the main task first. In this paper we devise a similar strategy that breaks down reasoning tasks into a problem decomposition phase and a problem solving phase and show that the strategy is able to outperform a single stage solution. Further, we hypothes… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  2. arXiv:2310.09720  [pdf, other

    cs.CL cs.LG

    HiCL: Hierarchical Contrastive Learning of Unsupervised Sentence Embeddings

    Authors: Zhuofeng Wu, Chaowei Xiao, VG Vinod Vydiswaran

    Abstract: In this paper, we propose a hierarchical contrastive learning framework, HiCL, which considers local segment-level and global sequence-level relationships to improve training efficiency and effectiveness. Traditional methods typically encode a sequence in its entirety for contrast with others, often neglecting local representation learning, leading to challenges in generalizing to shorter texts. C… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    Comments: In Proceedings of Findings EMNLP 2023

  3. arXiv:2305.02394  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Defending against Insertion-based Textual Backdoor Attacks via Attribution

    Authors: Jiazhao Li, Zhuofeng Wu, Wei **, Chaowei Xiao, V. G. Vinod Vydiswaran

    Abstract: Textual backdoor attack, as a novel attack model, has been shown to be effective in adding a backdoor to the model during training. Defending against such backdoor attacks has become urgent and important. In this paper, we propose AttDef, an efficient attribution-based pipeline to defend against two insertion-based poisoning attacks, BadNL and InSent. Specifically, we regard the tokens with larger… ▽ More

    Submitted 6 August, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023. Camera-ready version

    Report number: 15 pages

    Journal ref: Findings of ACL 2023, July 2023, Page 8818-8833, Toronto, Canada

  4. arXiv:2304.14475  [pdf, other

    cs.CR cs.LG

    ChatGPT as an Attack Tool: Stealthy Textual Backdoor Attack via Blackbox Generative Model Trigger

    Authors: Jiazhao Li, Yi** Yang, Zhuofeng Wu, V. G. Vinod Vydiswaran, Chaowei Xiao

    Abstract: Textual backdoor attacks pose a practical threat to existing systems, as they can compromise the model by inserting imperceptible triggers into inputs and manipulating labels in the training dataset. With cutting-edge generative models such as GPT-4 pushing rewriting to extraordinary levels, such attacks are becoming even harder to detect. We conduct a comprehensive investigation of the role of bl… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

  5. arXiv:2204.04497  [pdf, other

    cs.CL cs.LG

    IDPG: An Instance-Dependent Prompt Generation Method

    Authors: Zhuofeng Wu, Sinong Wang, Jiatao Gu, Rui Hou, Yuxiao Dong, V. G. Vinod Vydiswaran, Hao Ma

    Abstract: Prompt tuning is a new, efficient NLP transfer learning paradigm that adds a task-specific prompt in each input instance during the model training stage. It freezes the pre-trained language model and only optimizes a few task-specific prompts. In this paper, we propose a conditional prompt generation method to generate prompts for each input instance, referred to as the Instance-Dependent Prompt G… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

    Comments: To appear at the NAACL 2022 main conference

  6. PharmMT: A Neural Machine Translation Approach to Simplify Prescription Directions

    Authors: Jiazhao Li, Corey Lester, Xinyan Zhao, Yuting Ding, Yun Jiang, V. G. Vinod Vydiswaran

    Abstract: The language used by physicians and health professionals in prescription directions includes medical jargon and implicit directives and causes much confusion among patients. Human intervention to simplify the language at the pharmacies may introduce additional errors that can lead to potentially severe health outcomes. We propose a novel machine translation-based approach, PharmMT, to automaticall… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: Findings of EMNLP '20 Camera Ready

    Journal ref: Findings of EMNLP (2020) 2785--2796

  7. arXiv:2012.09157  [pdf, other

    cs.CL cs.AI

    LIREx: Augmenting Language Inference with Relevant Explanation

    Authors: Xinyan Zhao, V. G. Vinod Vydiswaran

    Abstract: Natural language explanations (NLEs) are a special form of data annotation in which annotators identify rationales (most significant text tokens) when assigning labels to data instances, and write out explanations for the labels in natural language based on the rationales. NLEs have been shown to capture human reasoning better, but not as beneficial for natural language inference (NLI). In this pa… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

    Comments: Accepted at AAAI 2021

  8. arXiv:1601.05140  [pdf

    cs.SI cs.AI cs.CY physics.data-an physics.soc-ph

    The DARPA Twitter Bot Challenge

    Authors: V. S. Subrahmanian, Amos Azaria, Skylar Durst, Vadim Kagan, Aram Galstyan, Kristina Lerman, Linhong Zhu, Emilio Ferrara, Alessandro Flammini, Filippo Menczer, Andrew Stevens, Alexander Dekhtyar, Shuyang Gao, Tad Hogg, Farshad Kooti, Yan Liu, Onur Varol, Prashant Shiralkar, Vinod Vydiswaran, Qiaozhu Mei, Tim Hwang

    Abstract: A number of organizations ranging from terrorist groups such as ISIS to politicians and nation states reportedly conduct explicit campaigns to influence opinion on social media, posing a risk to democratic processes. There is thus a growing need to identify and eliminate "influence bots" - realistic, automated identities that illicitly shape discussion on sites like Twitter and Facebook - before t… ▽ More

    Submitted 21 April, 2016; v1 submitted 19 January, 2016; originally announced January 2016.

    Comments: IEEE Computer Magazine, in press

    Journal ref: Computer 49 (6), 38-46. IEEE, 2016