Skip to main content

Showing 1–50 of 104 results for author: Liu, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00945  [pdf, other

    cs.LG

    Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs

    Authors: Enshu Liu, Junyi Zhu, Zinan Lin, Xuefei Ning, Matthew B. Blaschko, Shengen Yan, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: The rapid advancement of large language models (LLMs) has led to architectures with billions to trillions of parameters, posing significant deployment challenges due to their substantial demands on memory, processing power, and energy consumption. Sparse Mixture-of-Experts (SMoE) architectures have emerged as a solution, activating only a subset of parameters per token, thereby achieving faster in… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2406.20076  [pdf, other

    cs.CV

    EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model

    Authors: Yuxuan Zhang, Tianheng Cheng, Rui Hu, ei Liu, Heng Liu, Long** Ran, Xiaoxin Chen, Wenyu Liu, Xinggang Wang

    Abstract: Segment Anything Model (SAM) has attracted widespread attention for its superior interactive segmentation capabilities with visual prompts while lacking further exploration of text prompts. In this paper, we empirically investigate what text prompt encoders (e.g., CLIP or LLM) are good for adapting SAM for referring expression segmentation and introduce the Early Vision-language Fusion-based SAM (… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Preprint

  3. Blockchain Based Zero-Knowledge Proof of Location in IoT

    Authors: Wei Wu, Erwu Liu, Xinglin Gong, Rui Wang

    Abstract: With the development of precise positioning technology, a growing number of location-based services (LBSs) facilitate people's life. Most LBSs require proof of location (PoL) to prove that the user satisfies the service requirement, which exposes the user's privacy. In this paper, we propose a zero-knowledge proof of location (zk-PoL) protocol to better protect the user's privacy. With the zk-PoL… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Published on ICC 2020-2020 IEEE International Conference on Communications (ICC)

  4. arXiv:2406.11830  [pdf, other

    cs.CL cs.AI

    Language Modeling with Editable External Knowledge

    Authors: Belinda Z. Li, Emmy Liu, Alexis Ross, Abbas Zeitoun, Graham Neubig, Jacob Andreas

    Abstract: When the world changes, so does the text that humans write about it. How do we build language models that can be easily updated to reflect these changes? One popular approach is retrieval-augmented generation, in which new documents are inserted into a knowledge base and retrieved during prediction for downstream tasks. Most prior work on these systems have focused on improving behavior during pre… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2406.09181  [pdf, other

    cs.CV cs.AI

    A Large-scale Universal Evaluation Benchmark For Face Forgery Detection

    Authors: Yijun Bei, Hengrui Lou, **song Geng, Erteng Liu, Lechao Cheng, Jie Song, Mingli Song, Zunlei Feng

    Abstract: With the rapid development of AI-generated content (AIGC) technology, the production of realistic fake facial images and videos that deceive human visual perception has become possible. Consequently, various face forgery detection techniques have been proposed to identify such fake facial content. However, evaluating the effectiveness and generalizability of these detection techniques remains a si… ▽ More

    Submitted 13 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: This is a paper about constructing a large-scale universal evaluation benchmark for face forgery detection.The full text is 30 pages

  6. arXiv:2406.02540  [pdf, other

    cs.CV

    ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation

    Authors: Tianchen Zhao, Tongcheng Fang, Enshu Liu, Rui Wan, Widyadewi Soedarmadji, Shiyao Li, Zinan Lin, Guohao Dai, Shengen Yan, Huazhong Yang, Xuefei Ning, Yu Wang

    Abstract: Diffusion transformers (DiTs) have exhibited remarkable performance in visual generation tasks, such as generating realistic images or videos based on textual instructions. However, larger model sizes and multi-frame processing for video generation lead to increased computational and memory costs, posing challenges for practical deployment on edge devices. Post-Training Quantization (PTQ) is an ef… ▽ More

    Submitted 30 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Project Page: https://a-suozhang.xyz/viditq.github.io/

  7. arXiv:2406.01103  [pdf, other

    cs.AI cs.HC cs.LG

    Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment

    Authors: Chen Zhang, Qiang He, Zhou Yuan, Elvis S. Liu, Hong Wang, Jian Zhao, Yang Wang

    Abstract: Deep Reinforcement Learning (DRL) agents have demonstrated impressive success in a wide range of game genres. However, existing research primarily focuses on optimizing DRL competence rather than addressing the challenge of prolonged player interaction. In this paper, we propose a practical DRL agent system for fighting games named Shūkai, which has been successfully deployed to Naruto Mobile, a p… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accept at ICML 2024

  8. arXiv:2405.20220  [pdf, other

    cs.DC cs.CY

    BeerReview: A Blockchain-enabled Peer Review Platform

    Authors: Guodong **, Zihan Zhou, Wenzheng Tang, Kanglei Yu, Hao Xu, Erwu Liu

    Abstract: In an era of increasing concerns over intellectual property rights, traditional peer review systems face challenges including plagiarism, malicious attacks, and unauthorized data access. BeerReview, a blockchain-enabled peer review platform, offers a robust solution, enabling experts and scholars to participate actively in the review process without concerns about plagiarism or security threats. F… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  9. arXiv:2405.17873  [pdf, other

    cs.CV cs.AI

    MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization

    Authors: Tianchen Zhao, Xuefei Ning, Tongcheng Fang, Enshu Liu, Guyue Huang, Zinan Lin, Shengen Yan, Guohao Dai, Yu Wang

    Abstract: Diffusion models have achieved significant visual generation quality. However, their significant computational and memory costs pose challenge for their application on resource-constrained mobile devices or even desktop GPUs. Recent few-step diffusion models reduces the inference time by reducing the denoising steps. However, their memory consumptions are still excessive. The Post Training Quantiz… ▽ More

    Submitted 29 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Project Page: https://a-suozhang.xyz/mixdq.github.io/

  10. arXiv:2405.09757  [pdf, other

    cs.CR

    Give and Take: An End-To-End Investigation of Giveaway Scam Conversion Rates

    Authors: Enze Liu, George Kappos, Eric Mugnier, Luca Invernizzi, Stefan Savage, David Tao, Kurt Thomas, Geoffrey M. Voelker, Sarah Meiklejohn

    Abstract: Scams -- fraudulent schemes designed to swindle money from victims -- have existed for as long as recorded history. However, the Internet's combination of low communication cost, global reach, and functional anonymity has allowed scam volumes to reach new heights. Designing effective interventions requires first understanding the context: how scammers reach potential victims, the earnings they mak… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Under review

  11. arXiv:2405.09276  [pdf, other

    cs.LG cs.AI cs.DC

    Dual-Segment Clustering Strategy for Federated Learning in Heterogeneous Environments

    Authors: Pengcheng Sun, Erwu Liu, Wei Ni, Kanglei Yu, Rui Wang, Abbas Jamalipour

    Abstract: Federated learning (FL) is a distributed machine learning paradigm with high efficiency and low communication load, only transmitting parameters or gradients of network. However, the non-independent and identically distributed (Non-IID) data characteristic has a negative impact on this paradigm. Furthermore, the heterogeneity of communication quality will significantly affect the accuracy of param… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  12. arXiv:2405.08651  [pdf, other

    cs.DC

    BeACONS: A Blockchain-enabled Authentication and Communications Network for Scalable IoV

    Authors: Qi Shi, **gyi Sun, Hanwei Fu, Peizhe Fu, Jiayuan Ma, Hao Xu, Erwu Liu

    Abstract: This paper introduces a novel blockchain-enabled authentication and communications network for scalable Internet of Vehicles, which aims to bolster security and confidentiality, diminish communications latency, and reduce dependence on centralised infrastructures like Certificate Authorities and Public Key Infrastructures by leveraging Blockchain-enabled Domain Name Services and Blockchain-enabled… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  13. arXiv:2405.02320  [pdf, other

    cs.IT cs.AI

    A SER-based Device Selection Mechanism in Multi-bits Quantization Federated Learning

    Authors: Pengcheng Sun, Erwu Liu, Rui Wang

    Abstract: The quality of wireless communication will directly affect the performance of federated learning (FL), so this paper analyze the influence of wireless communication on FL through symbol error rate (SER). In FL system, non-orthogonal multiple access (NOMA) can be used as the basic communication framework to reduce the communication congestion and interference caused by multiple users, which takes a… ▽ More

    Submitted 20 April, 2024; originally announced May 2024.

  14. arXiv:2404.03028  [pdf, other

    cs.CL

    An Incomplete Loop: Deductive, Inductive, and Abductive Learning in Large Language Models

    Authors: Emmy Liu, Graham Neubig, Jacob Andreas

    Abstract: Modern language models (LMs) can learn to perform new tasks in different ways: in instruction following, the target task is described explicitly in natural language; in few-shot prompting, the task is specified implicitly with a small number of examples; in instruction inference, LMs are presented with in-context examples and are then prompted to generate a natural language task description before… ▽ More

    Submitted 10 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  15. arXiv:2404.02241  [pdf, other

    cs.CV

    Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better

    Authors: Enshu Liu, Junyi Zhu, Zinan Lin, Xuefei Ning, Matthew B. Blaschko, Sergey Yekhanin, Shengen Yan, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: Diffusion Models (DM) and Consistency Models (CM) are two types of popular generative models with good generation quality on various tasks. When training DM and CM, intermediate weight checkpoints are not fully utilized and only the last converged checkpoint is used. In this work, we find that high-quality model weights often lie in a basin which cannot be reached by SGD but can be obtained by pro… ▽ More

    Submitted 7 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  16. arXiv:2404.01817  [pdf, other

    cs.NE

    Tensorized NeuroEvolution of Augmenting Topologies for GPU Acceleration

    Authors: Lishuang Wang, Mengfei Zhao, Enyu Liu, Kebin Sun, Ran Cheng

    Abstract: The NeuroEvolution of Augmenting Topologies (NEAT) algorithm has received considerable recognition in the field of neuroevolution. Its effectiveness is derived from initiating with simple networks and incrementally evolving both their topologies and weights. Although its capability across various challenges is evident, the algorithm's computational efficiency remains an impediment, limiting its sc… ▽ More

    Submitted 11 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Genetic and Evolutionary Computation Conference (GECCO '24)

  17. arXiv:2403.13574  [pdf, other

    cs.IR cs.AI

    A Large Language Model Enhanced Sequential Recommender for Joint Video and Comment Recommendation

    Authors: Bowen Zheng, Zihan Lin, Enze Liu, Chen Yang, Enyang Bai, Cheng Ling, Wayne Xin Zhao, Ji-Rong Wen

    Abstract: In online video platforms, reading or writing comments on interesting videos has become an essential part of the video watching experience. However, existing video recommender systems mainly model users' interaction behaviors with videos, lacking consideration of comments in user behavior modeling. In this paper, we propose a novel recommendation approach called LSVCR by leveraging user interactio… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  18. arXiv:2403.02768  [pdf, other

    cs.CY

    An Empirical Analysis on the Use and Reporting of National Security Letters

    Authors: Alex Bellon, Miro Haller, Andrey Labunets, Enze Liu, Stefan Savage

    Abstract: National Security Letters (NSLs) are similar to administrative subpoenas and can be issued directly by elements of the executive branch without requiring prior approval from a court or grand jury. Importantly, NSLs authorize the imposition of nondisclosure orders (aka "gag orders") on the receiving party. Controversy about potential abuses of this authority has driven a range of legal and policy d… ▽ More

    Submitted 10 April, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted as short presentation at CSLAW 2024 (not in proceeding): https://computersciencelaw.org/2024/accepted-papers/

  19. arXiv:2402.19348  [pdf, other

    cs.LG cs.AI

    Deep Learning for Cross-Domain Data Fusion in Urban Computing: Taxonomy, Advances, and Outlook

    Authors: Xingchen Zou, Yibo Yan, Xixuan Hao, Yuehong Hu, Haomin Wen, Erdong Liu, Junbo Zhang, Yong Li, Tianrui Li, Yu Zheng, Yuxuan Liang

    Abstract: As cities continue to burgeon, Urban Computing emerges as a pivotal discipline for sustainable development by harnessing the power of cross-domain data fusion from diverse sources (e.g., geographical, traffic, social media, and environmental data) and modalities (e.g., spatio-temporal, visual, and textual modalities). Recently, we are witnessing a rising trend that utilizes various deep-learning m… ▽ More

    Submitted 16 June, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  20. arXiv:2402.17219  [pdf, other

    cs.CR

    Blockchain for Finance: A Survey

    Authors: Hanjie Wu, Qian Yao, Zhenguang Liu, Butian Huang, Yuan Zhuang, Huayun Tang, Erwu Liu

    Abstract: As an innovative technology for enhancing authenticity, security, and risk management, blockchain is being widely adopted in trade and finance systems. The unique capabilities of blockchain, such as immutability and transparency, enable new business models of distributed data storage, point-to-point transactions, and decentralized autonomous organizations. In this paper, we focus on blockchain-bas… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  21. arXiv:2402.01115  [pdf, other

    cs.CL eess.SP

    Interpretation of Intracardiac Electrograms Through Textual Representations

    Authors: William Jongwon Han, Diana Gomez, Avi Alok, Chao**g Duan, Michael A. Rosenberg, Douglas Weber, Emerson Liu, Ding Zhao

    Abstract: Understanding the irregular electrical activity of atrial fibrillation (AFib) has been a key challenge in electrocardiography. For serious cases of AFib, catheter ablations are performed to collect intracardiac electrograms (EGMs). EGMs offer intricately detailed and localized electrical activity of the heart and are an ideal modality for interpretable cardiac studies. Recent advancements in artif… ▽ More

    Submitted 11 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 18 pages, 9 figures; Accepted to CHIL 2024

    ACM Class: I.2.7; J.3

  22. arXiv:2312.12634  [pdf, other

    cs.CV cs.AI cs.CL cs.RO

    MotionScript: Natural Language Descriptions for Expressive 3D Human Motions

    Authors: Payam Jome Yazdian, Eric Liu, Li Cheng, Angelica Lim

    Abstract: This paper proposes MotionScript, a motion-to-text conversion algorithm and natural language representation for human body motions. MotionScript aims to describe movements in greater detail and with more accuracy than previous natural language approaches. Many motion datasets describe relatively objective and simple actions with little variation on the way they are expressed (e.g. sitting, walking… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  23. arXiv:2312.11084  [pdf, other

    cs.RO cs.MA

    Multi-Agent Reinforcement Learning for Connected and Automated Vehicles Control: Recent Advancements and Future Prospects

    Authors: Min Hua, Dong Chen, Xinda Qi, Kun Jiang, Zemin Eitan Liu, Quan Zhou, Hongming Xu

    Abstract: Connected and automated vehicles (CAVs) have emerged as a potential solution to the future challenges of develo** safe, efficient, and eco-friendly transportation systems. However, CAV control presents significant challenges, given the complexity of interconnectivity and coordination required among the vehicles. To address this, multi-agent reinforcement learning (MARL), with its notable advance… ▽ More

    Submitted 16 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  24. arXiv:2312.07243  [pdf, other

    cs.AI

    A Unified Sampling Framework for Solver Searching of Diffusion Probabilistic Models

    Authors: Enshu Liu, Xuefei Ning, Huazhong Yang, Yu Wang

    Abstract: Recent years have witnessed the rapid progress and broad application of diffusion probabilistic models (DPMs). Sampling from DPMs can be viewed as solving an ordinary differential equation (ODE). Despite the promising performance, the generation of DPMs usually consumes much time due to the large number of function evaluations (NFE). Though recent works have accelerated the sampling to around 20 s… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  25. arXiv:2311.09553  [pdf, other

    cs.AI

    Program-Aided Reasoners (better) Know What They Know

    Authors: Anubha Kabra, Sanketh Rangreji, Yash Mathur, Aman Madaan, Emmy Liu, Graham Neubig

    Abstract: Prior work shows that program-aided reasoning, in which large language models (LLMs) are combined with programs written in programming languages such as Python, can significantly improve accuracy on various reasoning tasks. However, while accuracy is essential, it is also important for such reasoners to "know what they know", which can be quantified through the calibration of the model. In this pa… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  26. arXiv:2311.09308  [pdf, other

    cs.CL cs.AI cs.LG q-bio.NC

    Divergences between Language Models and Human Brains

    Authors: Yuchen Zhou, Emmy Liu, Graham Neubig, Michael J. Tarr, Leila Wehbe

    Abstract: Do machines and humans process language in similar ways? Recent research has hinted in the affirmative, finding that brain signals can be effectively predicted using the internal representations of language models (LMs). Although such results are thought to reflect shared computational principles between LMs and human brains, there are also clear differences in how LMs and humans represent and use… ▽ More

    Submitted 4 February, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  27. arXiv:2311.08706  [pdf, other

    cs.CY cs.AI

    Aligned: A Platform-based Process for Alignment

    Authors: Ethan Shaotran, Ido Pesok, Sam Jones, Emi Liu

    Abstract: We are introducing Aligned, a platform for global governance and alignment of frontier models, and eventually superintelligence. While previous efforts at the major AI labs have attempted to gather inputs for alignment, these are often conducted behind closed doors. We aim to set the foundation for a more trustworthy, public-facing approach to safety: a constitutional committee framework. Initial… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 11 pages, 7 figures. For associated public report, see https://energize.ai/openai

  28. arXiv:2311.03707  [pdf, other

    cs.AI cs.LG cs.MA

    The NeurIPS 2022 Neural MMO Challenge: A Massively Multiagent Competition with Specialization and Trade

    Authors: Enhong Liu, Joseph Suarez, Chenhui You, Bo Wu, Bingcheng Chen, Jun Hu, Jiaxin Chen, Xiaolong Zhu, Clare Zhu, Julian Togelius, Sharada Mohanty, Weijun Hong, Rui Du, Yibing Zhang, Qinwen Wang, Xinhang Li, Zheng Yuan, Xiang Li, Yuejia Huang, Kun Zhang, Hanhui Yang, Shiqi Tang, Phillip Isola

    Abstract: In this paper, we present the results of the NeurIPS-2022 Neural MMO Challenge, which attracted 500 participants and received over 1,600 submissions. Like the previous IJCAI-2022 Neural MMO Challenge, it involved agents from 16 populations surviving in procedurally generated worlds by collecting resources and defeating opponents. This year's competition runs on the latest v1.6 Neural MMO, which in… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  29. arXiv:2310.07081  [pdf, other

    cs.CL

    Crossing the Threshold: Idiomatic Machine Translation through Retrieval Augmentation and Loss Weighting

    Authors: Emmy Liu, Aditi Chaudhary, Graham Neubig

    Abstract: Idioms are common in everyday language, but often pose a challenge to translators because their meanings do not follow from the meanings of their parts. Despite significant advances, machine translation systems still struggle to translate idiomatic expressions. We provide a simple characterization of idiomatic translation and related issues. This allows us to conduct a synthetic experiment reveali… ▽ More

    Submitted 20 October, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  30. arXiv:2310.03303  [pdf, other

    cs.RO

    A Two-stage Based Social Preference Recognition in Multi-Agent Autonomous Driving System

    Authors: **tao Xue, Dongkun Zhang, Rong Xiong, Yue Wang, Eryun Liu

    Abstract: Multi-Agent Reinforcement Learning (MARL) has become a promising solution for constructing a multi-agent autonomous driving system (MADS) in complex and dense scenarios. But most methods consider agents acting selfishly, which leads to conflict behaviors. Some existing works incorporate the concept of social value orientation (SVO) to promote coordination, but they lack the knowledge of other agen… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  31. arXiv:2309.01415  [pdf, other

    physics.soc-ph cs.CY

    A generalized vector-field framework for mobility

    Authors: Erjian Liu, Mattia Mazzoli, Xiao-Yong Yan, Jose J. Ramasco

    Abstract: Trip flow between areas is a fundamental metric for human mobility research. Given its identification with travel demand and its relevance for transportation and urban planning, many models have been developed for its estimation. These models focus on flow intensity, disregarding the information provided by the local mobility orientation. A field-theoretic approach can overcome this issue and hand… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: 13 pages, 8 figures, Appendices

  32. arXiv:2308.11849  [pdf, other

    eess.SY cs.AI cs.LG

    A Mobile Data-Driven Hierarchical Deep Reinforcement Learning Approach for Real-time Demand-Responsive Railway Rescheduling and Station Overcrowding Mitigation

    Authors: Enze Liu, Zhiyuan Lin, Judith Y. T. Wang, Hong Chen

    Abstract: Real-time railway rescheduling is an important technique to enable operational recovery in response to unexpected and dynamic conditions in a timely and flexible manner. Current research relies mostly on OD based data and model-based methods for estimating train passenger demands. These approaches primarily focus on averaged disruption patterns, often overlooking the immediate uneven distribution… ▽ More

    Submitted 6 November, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: 42 pages,20 figures

  33. arXiv:2308.06829  [pdf, other

    cs.NI cs.AR cs.CR cs.DC

    When Web 3.0 Meets Reality: A Hyperdimensional Fractal Polytope P2P Ecosystems

    Authors: Hao Xu, Yunqing Sun, Xiaoshuai Zhang, Erwu Liu, Chih-Lin I

    Abstract: Web 3.0 opens the world of new existence of the crypto-network-entity, which is independently defined by the public key pairs for entities and the connection to the Web 3.0 cyberspace. In this paper, we first discover a spacetime coordinate system based on fractal polytope in any dimensions with discrete time offered by blockchain and consensus. Second, the novel network entities and functions are… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

  34. arXiv:2306.13023  [pdf, other

    cs.CV

    AugDMC: Data Augmentation Guided Deep Multiple Clustering

    Authors: Jiawei Yao, Enbei Liu, Maham Rashid, Juhua Hu

    Abstract: Clustering aims to group similar objects together while separating dissimilar ones apart. Thereafter, structures hidden in data can be identified to help understand data in an unsupervised manner. Traditional clustering methods such as k-means provide only a single clustering for one data set. Deep clustering methods such as auto-encoder based clustering methods have shown a better performance, bu… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  35. arXiv:2306.08860  [pdf, other

    cs.LG

    OMS-DPM: Optimizing the Model Schedule for Diffusion Probabilistic Models

    Authors: Enshu Liu, Xuefei Ning, Zinan Lin, Huazhong Yang, Yu Wang

    Abstract: Diffusion probabilistic models (DPMs) are a new class of generative models that have achieved state-of-the-art generation quality in various domains. Despite the promise, one major drawback of DPMs is the slow generation speed due to the large number of neural network evaluations required in the generation process. In this paper, we reveal an overlooked dimension -- model schedule -- for optimizin… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted by ICML2023

  36. arXiv:2306.08400  [pdf, other

    cs.CL cs.AI cs.LG

    Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning

    Authors: Evan Zheran Liu, Sahaana Suri, Tong Mu, Allan Zhou, Chelsea Finn

    Abstract: Whereas machine learning models typically learn language by directly training on language tasks (e.g., next-word prediction), language emerges in human children as a byproduct of solving non-language tasks (e.g., acquiring food). Motivated by this observation, we ask: can embodied reinforcement learning (RL) agents also indirectly learn language from non-language tasks? Learning to associate langu… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: International Conference on Machine Learning (ICML), 2023

  37. arXiv:2305.18185  [pdf, other

    cs.CL

    Syntax and Semantics Meet in the "Middle": Probing the Syntax-Semantics Interface of LMs Through Agentivity

    Authors: Lindia Tjuatja, Emmy Liu, Lori Levin, Graham Neubig

    Abstract: Recent advances in large language models have prompted researchers to examine their abilities across a variety of linguistic tasks, but little has been done to investigate how models handle the interactions in meaning across words and larger syntactic forms -- i.e. phenomena at the intersection of syntax and semantics. We present the semantic notion of agentivity as a case study for probing such i… ▽ More

    Submitted 10 July, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  38. arXiv:2305.16567  [pdf, other

    cs.LG cs.CV

    Structured Latent Variable Models for Articulated Object Interaction

    Authors: Emily Liu, Michael Noseworthy, Nicholas Roy

    Abstract: In this paper, we investigate a scenario in which a robot learns a low-dimensional representation of a door given a video of the door opening or closing. This representation can be used to infer door-related parameters and predict the outcomes of interacting with the door. Current machine learning based approaches in the doors domain are based primarily on labelled datasets. However, the large qua… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  39. arXiv:2305.16171  [pdf

    cs.CL

    Multi-lingual and Multi-cultural Figurative Language Understanding

    Authors: Anubha Kabra, Emmy Liu, Simran Khanuja, Alham Fikri Aji, Genta Indra Winata, Samuel Cahyawijaya, Anuoluwapo Aremu, Perez Ogayo, Graham Neubig

    Abstract: Figurative language permeates human communication, but at the same time is relatively understudied in NLP. Datasets have been created in English to accelerate progress towards measuring and improving figurative language processing in language models (LMs). However, the use of figurative language is an expression of our cultural and societal experiences, making it difficult for these phrases to be… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: ACL 2023 Findings

  40. arXiv:2305.10830  [pdf

    cs.AI

    Constructing a personalized AI assistant for shear wall layout using Stable Diffusion

    Authors: Lufeng Wang, Jiepeng Liu, Guozhong Cheng, En Liu, Wei Chen

    Abstract: Shear wall structures are widely used in high-rise residential buildings, and the layout of shear walls requires many years of design experience and iterative trial and error. Currently, there are methods based on heuristic algorithms, but they generate results too slowly. Those based on Generative Adversarial Networks (GANs) or Graph Neural Networks (GNNs) can only generate single arrangements an… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  41. arXiv:2305.00955  [pdf, other

    cs.CL cs.AI cs.LG

    Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation

    Authors: Patrick Fernandes, Aman Madaan, Emmy Liu, António Farinhas, Pedro Henrique Martins, Amanda Bertsch, José G. C. de Souza, Shuyan Zhou, Tongshuang Wu, Graham Neubig, André F. T. Martins

    Abstract: Many recent advances in natural language generation have been fueled by training large language models on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving mod… ▽ More

    Submitted 31 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: Work in Progress

  42. arXiv:2304.12821  [pdf, other

    cs.RO

    Zero-shot Transfer Learning of Driving Policy via Socially Adversarial Traffic Flow

    Authors: Dongkun Zhang, **tao Xue, Yuxiang Cui, Yunkai Wang, Eryun Liu, Wei **g, Junbo Chen, Rong Xiong, Yue Wang

    Abstract: Acquiring driving policies that can transfer to unseen environments is challenging when driving in dense traffic flows. The design of traffic flow is essential and previous studies are unable to balance interaction and safety-criticism. To tackle this problem, we propose a socially adversarial traffic flow. We propose a Contextual Partially-Observable Stochastic Game to model traffic flow and assi… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  43. arXiv:2304.06286  [pdf, other

    eess.SP cs.CV

    Automated Cardiovascular Record Retrieval by Multimodal Learning between Electrocardiogram and Clinical Report

    Authors: Jielin Qiu, Jiacheng Zhu, Shiqi Liu, William Han, **gqi Zhang, Chao**g Duan, Michael Rosenberg, Emerson Liu, Douglas Weber, Ding Zhao

    Abstract: Automated interpretation of electrocardiograms (ECG) has garnered significant attention with the advancements in machine learning methodologies. Despite the growing interest, most current studies focus solely on classification or regression tasks, which overlook a crucial aspect of clinical cardio-disease diagnosis: the diagnostic report generated by experienced human clinicians. In this paper, we… ▽ More

    Submitted 6 November, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: Accepted to the ML4H 2023 Proceedings track

  44. arXiv:2304.06177  [pdf, other

    cs.CV cs.AI

    Visual based Tomato Size Measurement System for an Indoor Farming Environment

    Authors: Andy Kweon, Vishnu Hu, Jong Yoon Lim, Trevor Gee, Edmond Liu, Henry Williams, Bruce A. MacDonald, Mahla Nejati, Inkyu Sa, Ho Seok Ahn

    Abstract: As technology progresses, smart automated systems will serve an increasingly important role in the agricultural industry. Current existing vision systems for yield estimation face difficulties in occlusion and scalability as they utilize a camera system that is large and expensive, which are unsuitable for orchard environments. To overcome these problems, this paper presents a size measurement met… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: 10 Pages, 12 Figures

  45. arXiv:2304.03708  [pdf, other

    eess.IV cs.CV

    Efficient automatic segmentation for multi-level pulmonary arteries: The PARSE challenge

    Authors: Gongning Luo, Kuanquan Wang, Jun Liu, Shuo Li, Xinjie Liang, Xiangyu Li, Shaowei Gan, Wei Wang, Suyu Dong, Wenyi Wang, Pengxin Yu, Enyou Liu, Hongrong Wei, Na Wang, Jia Guo, Huiqi Li, Zhao Zhang, Ziwei Zhao, Na Gao, Nan An, Ashkan Pakzad, Bojidar Rangelov, Jiaqi Dou, Song Tian, Zeyu Liu , et al. (5 additional authors not shown)

    Abstract: Efficient automatic segmentation of multi-level (i.e. main and branch) pulmonary arteries (PA) in CTPA images plays a significant role in clinical applications. However, most existing methods concentrate only on main PA or branch PA segmentation separately and ignore segmentation efficiency. Besides, there is no public large-scale dataset focused on PA segmentation, which makes it highly challengi… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  46. arXiv:2303.14347  [pdf, other

    cs.RO cs.CV

    Vision-based Vineyard Navigation Solution with Automatic Annotation

    Authors: Ertai Liu, Josephine Monica, Kaitlin Gold, Lance Cadle-Davidson, David Combs, Yu Jiang

    Abstract: Autonomous navigation is the key to achieving the full automation of agricultural research and production management (e.g., disease management and yield prediction) using agricultural robots. In this paper, we introduced a vision-based autonomous navigation framework for agriculture robots in trellised crop** systems such as vineyards. To achieve this, we proposed a novel learning-based method t… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: Submission to IROS 2023

  47. arXiv:2303.14240  [pdf, other

    cs.CV

    Adaptive Base-class Suppression and Prior Guidance Network for One-Shot Object Detection

    Authors: Wenwen Zhang, Xinyu Xiao, Hangguan Shan, Eryun Liu

    Abstract: One-shot object detection (OSOD) aims to detect all object instances towards the given category specified by a query image. Most existing studies in OSOD endeavor to explore effective cross-image correlation and alleviate the semantic feature misalignment, however, ignoring the phenomenon of the model bias towards the base classes and the generalization degradation on the novel classes. Observing… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  48. arXiv:2303.01502  [pdf, other

    cs.CL cs.AI

    Computational Language Acquisition with Theory of Mind

    Authors: Andy Liu, Hao Zhu, Emmy Liu, Yonatan Bisk, Graham Neubig

    Abstract: Unlike current state-of-the-art language models, young children actively acquire language through interactions with their surrounding environment and caretakers. One mechanism that has been argued to be critical to language learning is the ability to infer the mental states of other agents in social environments, coined Theory of Mind (ToM) by Premack & Woodruff (1978). Drawing inspiration from th… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: 9 pages, 3 figures. To be published in the 11th International Conference on Learning Representations, ICLR 2023, Conference Track Proceedings

  49. arXiv:2302.07287  [pdf, other

    cs.CR

    Forward Pass: On the Security Implications of Email Forwarding Mechanism and Policy

    Authors: Enze Liu, Gautam Akiwate, Mattijs Jonker, Ariana Mirian, Grant Ho, Geoffrey M. Voelker, Stefan Savage

    Abstract: The critical role played by email has led to a range of extension protocols (e.g., SPF, DKIM, DMARC) designed to protect against the spoofing of email sender domains. These protocols are complex as is, but are further complicated by automated email forwarding -- used by individual users to manage multiple accounts and by mailing lists to redistribute messages. In this paper, we explore how such em… ▽ More

    Submitted 19 April, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: The paper appeared at the 8th IEEE European Symposium on Security and Privacy

    Journal ref: The 8th IEEE European Symposium on Security and Privacy, 2023

  50. arXiv:2302.00932  [pdf, other

    cs.LG cs.AI

    Dynamic Ensemble of Low-fidelity Experts: Mitigating NAS "Cold-Start"

    Authors: Junbo Zhao, Xuefei Ning, Enshu Liu, Binxin Ru, Zixuan Zhou, Tianchen Zhao, Chen Chen, Jia** Zhang, Qingmin Liao, Yu Wang

    Abstract: Predictor-based Neural Architecture Search (NAS) employs an architecture performance predictor to improve the sample efficiency. However, predictor-based NAS suffers from the severe ``cold-start'' problem, since a large amount of architecture-performance data is required to get a working predictor. In this paper, we focus on exploiting information in cheaper-to-obtain performance estimations (i.e.… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.