Skip to main content

Showing 101–150 of 551 results for author: Ji, H

.
  1. arXiv:2308.16040  [pdf, other

    quant-ph

    Native approach to controlled-Z gates in inductively coupled fluxonium qubits

    Authors: Xizheng Ma, Gengyan Zhang, Feng Wu, Feng Bao, Xu Chang, Jianjun Chen, Hao Deng, Ran Gao, Xun Gao, Lijuan Hu, Honghong Ji, Hsiang-Sheng Ku, Kannan Lu, Lu Ma, Liyong Mao, Zhijun Song, Hantao Sun, Chengchun Tang, Fei Wang, Hongcheng Wang, Tenghui Wang, Tian Xia, Make Ying, Huijuan Zhan, Tao Zhou , et al. (5 additional authors not shown)

    Abstract: The fluxonium qubits have emerged as a promising platform for gate-based quantum information processing. However, their extraordinary protection against charge fluctuations comes at a cost: when coupled capacitively, the qubit-qubit interactions are restricted to XX-interactions. Consequently, effective XX- or XZ-interactions are only constructed either by temporarily populating higher-energy stat… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  2. arXiv:2308.14507  [pdf, other

    math.ST cs.IT cs.LG math.PR stat.ML

    Spectral Estimators for Structured Generalized Linear Models via Approximate Message Passing

    Authors: Yihan Zhang, Hong Chang Ji, Ramji Venkataramanan, Marco Mondelli

    Abstract: We consider the problem of parameter estimation in a high-dimensional generalized linear model. Spectral methods obtained via the principal eigenvector of a suitable data-dependent matrix provide a simple yet surprisingly effective solution. However, despite their wide use, a rigorous performance characterization, as well as a principled way to preprocess the data, are available only for unstructu… ▽ More

    Submitted 3 July, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

  3. arXiv:2308.12537  [pdf, other

    cs.RO cs.CV

    HuBo-VLM: Unified Vision-Language Model designed for HUman roBOt interaction tasks

    Authors: Zichao Dong, Weikun Zhang, Xufeng Huang, Hang Ji, Xin Zhan, Junbo Chen

    Abstract: Human robot interaction is an exciting task, which aimed to guide robots following instructions from human. Since huge gap lies between human natural language and machine codes, end to end human robot interaction models is fair challenging. Further, visual information receiving from sensors of robot is also a hard language for robot to perceive. In this work, HuBo-VLM is proposed to tackle percept… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  4. arXiv:2308.11768  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Ferromagnetic and insulating behavior in both half magnetic levitation and non-levitation LK-99 like samples

    Authors: Pinyuan Wang, Xiaoqi Liu, Jun Ge, Chengcheng Ji, Haoran Ji, Yanzhao Liu, Yiwen Ai, Gaoxing Ma, Shichao Qi, Jian Wang

    Abstract: Finding materials exhibiting superconductivity at room temperature has long been one of the ultimate goals in physics and material science. Recently, room-temperature superconducting properties have been claimed in a copper substituted lead phosphate apatite (Pb$_{10-x}$Cu$_x$(PO$_4$)$_6$O, or called LK-99) [1-3]. Using a similar approach, we have prepared LK-99 like samples and confirmed the half… ▽ More

    Submitted 28 August, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

    Journal ref: Quantum Front 2, 10 (2023)

  5. arXiv:2308.10705  [pdf, other

    cs.CV cs.AI

    Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling

    Authors: Haorui Ji, Hui Deng, Yuchao Dai, Hongdong Li

    Abstract: Most of the previous 3D human pose estimation work relied on the powerful memory capability of the network to obtain suitable 2D-3D map**s from the training data. Few works have studied the modeling of human posture deformation in motion. In this paper, we propose a new modeling method for human pose deformations and design an accompanying diffusion-based motion prior. Inspired by the field of n… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

  6. arXiv:2308.01227  [pdf, other

    cs.IT

    Towards Integrated Sensing and Communications for 6G: A Standardization Perspective

    Authors: Aryan Kaushik, Rohit Singh, Shalanika Dayarathna, Rajitha Senanayake, Marco Di Renzo, Miguel Dajer, Hyoungju Ji, Younsun Kim, Vincenzo Sciancalepore, Alessio Zappone, Wonjae Shin

    Abstract: The radio communication division of the International Telecommunication Union (ITU-R) has recently adopted Integrated Sensing and Communication (ISAC) among the key usage scenarios for IMT-2030/6G. ISAC is envisioned to play a vital role in the upcoming wireless generation standards. In this work, we bring together several paramount and innovative aspects of ISAC technology from a global 6G standa… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 7 pages, 5 figures

  7. arXiv:2308.00910  [pdf, ps, other

    math.NA

    A Mini Immersed Finite Element Method for Two-Phase Stokes Problems on Cartesian Meshes

    Authors: Haifeng Ji, Dong Liang, Qian Zhang

    Abstract: This paper presents a mini immersed finite element (IFE) method for solving two- and three-dimensional two-phase Stokes problems on Cartesian meshes. The IFE space is constructed from the conventional mini element with shape functions modified on interface elements according to interface jump conditions, while kee** the degrees of freedom unchanged. Both discontinuous viscosity coefficients and… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  8. arXiv:2307.11694  [pdf, other

    cs.AI cs.LG q-bio.BM q-bio.MN

    SynerGPT: In-Context Learning for Personalized Drug Synergy Prediction and Drug Design

    Authors: Carl Edwards, Aakanksha Naik, Tushar Khot, Martin Burke, Heng Ji, Tom Hope

    Abstract: Predicting synergistic drug combinations can help accelerate discovery of cancer treatments, particularly therapies personalized to a patient's specific tumor via biopsied cells. In this paper, we propose a novel setting and models for in-context drug synergy learning. We are given a small "personalized dataset" of 10-20 drug synergy relationships in the context of specific cancer cell targets. Ou… ▽ More

    Submitted 24 October, 2023; v1 submitted 19 June, 2023; originally announced July 2023.

  9. arXiv:2307.11316  [pdf, other

    cs.CL cs.LG

    Making Pre-trained Language Models both Task-solvers and Self-calibrators

    Authors: Yangyi Chen, Xingyao Wang, Heng Ji

    Abstract: Pre-trained language models (PLMs) serve as backbones for various real-world systems. For high-stake applications, it's equally essential to have reasonable confidence estimations in predictions. While the vanilla confidence scores of PLMs can already be effectively utilized, PLMs consistently become overconfident in their wrong predictions, which is not desirable in practice. Previous work shows… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: Accepted to Findings of ACL 2023

  10. arXiv:2307.08626  [pdf, other

    math.PR math.FA

    Density of Brown measure of free circular Brownian motion

    Authors: László Erdős, Hong Chang Ji

    Abstract: We consider the Brown measure of free circular Brownian motion $\boldsymbol{a}+\sqrt{t}\boldsymbol{x}$, where $\boldsymbol{a}$ is a general non-normal operator and $\boldsymbol{x}$ is a circular element $*$-free from $\boldsymbol{a}$. We prove that, under a mild assumption on $\boldsymbol{a}$, the density of the Brown measure has one of the following two types of behavior around each point on the… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 26 pages, 4 figures

    MSC Class: 46L54; 60B20

  11. arXiv:2307.08423  [pdf, other

    cs.LG physics.comp-ph

    Artificial Intelligence for Science in Quantum, Atomistic, and Continuum Systems

    Authors: Xuan Zhang, Limei Wang, Jacob Helwig, Youzhi Luo, Cong Fu, Yaochen Xie, Meng Liu, Yuchao Lin, Zhao Xu, Keqiang Yan, Keir Adams, Maurice Weiler, Xiner Li, Tianfan Fu, Yucheng Wang, Haiyang Yu, YuQing Xie, Xiang Fu, Alex Strasser, Shenglong Xu, Yi Liu, Yuanqi Du, Alexandra Saxton, Hongyi Ling, Hannah Lawrence , et al. (38 additional authors not shown)

    Abstract: Advances in artificial intelligence (AI) are fueling a new paradigm of discoveries in natural sciences. Today, AI has started to advance natural sciences by improving, accelerating, and enabling our understanding of natural phenomena at a wide range of spatial and temporal scales, giving rise to a new area of research known as AI for science (AI4Science). Being an emerging research paradigm, AI4Sc… ▽ More

    Submitted 15 November, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

  12. arXiv:2307.07109  [pdf, other

    physics.plasm-ph astro-ph.HE astro-ph.SR physics.space-ph

    Laboratory Study of Collisionless Magnetic Reconnection

    Authors: H. Ji, J. Yoo, W. Fox, M. Yamada, M. Argall, J. Egedal, Y. -H. Liu, R. Wilder, S. Eriksson, W. Daughton, K. Bergstedt, S. Bose, J. Burch, R. Torbert, J. Ng, L. -J. Chen

    Abstract: A concise review is given on the past two decades' results from laboratory experiments on collisionless magnetic reconnection in direct relation with space measurements, especially by Magnetospheric Multiscale (MMS) mission. Highlights include spatial structures of electromagnetic fields in ion and electron diffusion regions as a function of upstream symmetry and guide field strength; energy conve… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: 40 pages, 15 figures

    Journal ref: ISSI book titled "Magnetic Reconnection: Explosive Energy Conversion in Space Plasmas" (2023)

  13. arXiv:2307.05873  [pdf, other

    cs.CV

    OG: Equip vision occupancy with instance segmentation and visual grounding

    Authors: Zichao Dong, Hang Ji, Weikun Zhang, Xufeng Huang, Junbo Chen

    Abstract: Occupancy prediction tasks focus on the inference of both geometry and semantic labels for each voxel, which is an important perception mission. However, it is still a semantic segmentation task without distinguishing various instances. Further, although some existing works, such as Open-Vocabulary Occupancy (OVO), have already solved the problem of open vocabulary detection, visual grounding in o… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  14. arXiv:2307.05300  [pdf, other

    cs.AI cs.CL

    Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration

    Authors: Zhenhailong Wang, Shaoguang Mao, Wenshan Wu, Tao Ge, Furu Wei, Heng Ji

    Abstract: Human intelligence thrives on cognitive synergy, where collaboration among different minds yield superior outcomes compared to isolated individuals. In this work, we propose Solo Performance Prompting (SPP), which transforms a single LLM into a cognitive synergist by engaging in multi-turn self-collaboration with multiple personas. A cognitive synergist is an intelligent agent that collaboratively… ▽ More

    Submitted 26 March, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Accepted as a main conference paper at NAACL 2024

  15. arXiv:2307.01972  [pdf, other

    cs.CL

    Open-Domain Hierarchical Event Schema Induction by Incremental Prompting and Verification

    Authors: Sha Li, Ruining Zhao, Manling Li, Heng Ji, Chris Callison-Burch, Jiawei Han

    Abstract: Event schemas are a form of world knowledge about the typical progression of events. Recent methods for event schema induction use information extraction systems to construct a large number of event graph instances from documents, and then learn to generalize the schema from such instances. In contrast, we propose to treat event schemas as a form of commonsense knowledge that can be derived from l… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: Accepted to ACL 2023. 19 pages with appendix

  16. arXiv:2306.16602  [pdf, other

    physics.flu-dyn math.AP

    An electro-hydrodynamics modeling of droplet actuation on solid surface by surfactant-mediated electro-dewetting

    Authors: Weiqi Chu, Hangjie Ji, Qining Wang, Chang-** "CJ'' Kim, Andrea L. Bertozzi

    Abstract: We propose an electro-hydrodynamics model to describe the dynamic evolution of a slender drop containing a dilute ionic surfactant on a naturally wettable surface, with a varying external electric field. This unified model reproduces fundamental microfluidic operations controlled by electrical signals, including dewetting, rewetting, and droplet shifting. In this paper, lubrication theory analysis… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 16 pages, 13 figures

  17. arXiv:2306.15245  [pdf, other

    cs.CL

    C-PMI: Conditional Pointwise Mutual Information for Turn-level Dialogue Evaluation

    Authors: Liliang Ren, Mankeerat Sidhu, Qi Zeng, Revanth Gangi Reddy, Heng Ji, ChengXiang Zhai

    Abstract: Existing reference-free turn-level evaluation metrics for chatbots inadequately capture the interaction between the user and the system. Consequently, they often correlate poorly with human evaluations. To address this issue, we propose a novel model-agnostic approach that leverages Conditional Pointwise Mutual Information (C-PMI) to measure the turn-level interaction between the system and the us… ▽ More

    Submitted 1 September, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Published at ACL2023 DialDoc Workshop; Updated Results

  18. arXiv:2306.10599  [pdf, ps, other

    cs.SE

    An Empirical Study of Untangling Patterns of Two-Class Dependency Cycles

    Authors: Qiong Feng, Shuwen Liu, Huan Ji, Xiaotian Ma, Peng Liang

    Abstract: Dependency cycles pose a significant challenge to software quality and maintainability. However, there is limited understanding of how practitioners resolve dependency cycles in real-world scenarios. This paper presents an empirical study investigating the recurring patterns employed by software developers to resolve dependency cycles between two classes in practice. We analyzed the data from 38 o… ▽ More

    Submitted 17 December, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: Preprint accepted for publication in Empirical Software Engineering, 2023

  19. arXiv:2306.04909  [pdf

    astro-ph.SR astro-ph.HE physics.plasm-ph physics.space-ph

    Particle acceleration in solar flares with imaging-spectroscopy in soft X-rays

    Authors: Mitsuo Oka, Amir Caspi, Bin Chen, Mark Cheung, James Drake, Dale Gary, Lindsay Glesener, Fan Guo, Hantao Ji, Xiaocan Li, Takuma Nakamura, Noriyuki Narukage, Katharine Reeves, Pascal Saint-Hilaire, Taro Sakao, Chengcai Shen, Amy Winebarger, Tom Woods

    Abstract: Particles are accelerated to very high, non-thermal energies during explosive energy-release phenomena in space, solar, and astrophysical plasma environments. In the case of solar flares, it has been established that magnetic reconnection plays an important role for releasing the magnetic energy, but it remains unclear if or how magnetic reconnection can further explain particle acceleration durin… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: White paper submitted to the Decadal Survey for Solar and Space Physics (Heliophysics) 2024-2033; 10 pages, 2 figures

    Journal ref: Bulletin of the AAS, Vol. 55, Issue 3, Whitepaper #302 (10pp); 2023 July 31

  20. arXiv:2306.04618  [pdf, other

    cs.CL cs.CR cs.LG

    Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations

    Authors: Lifan Yuan, Yangyi Chen, Ganqu Cui, Hongcheng Gao, Fangyuan Zou, Xingyi Cheng, Heng Ji, Zhiyuan Liu, Maosong Sun

    Abstract: This paper reexamines the research on out-of-distribution (OOD) robustness in the field of NLP. We find that the distribution shift settings in previous studies commonly lack adequate challenges, hindering the accurate evaluation of OOD robustness. To address these issues, we propose a benchmark construction protocol that ensures clear differentiation and challenging distribution shifts. Then we i… ▽ More

    Submitted 26 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted to NeurIPS 2023 Dataset and Benchmark Track. Code is available at \url{https://github.com/lifan-yuan/OOD_NLP}

  21. arXiv:2306.00887  [pdf, other

    cs.CL

    OpenPI-C: A Better Benchmark and Stronger Baseline for Open-Vocabulary State Tracking

    Authors: Xueqing Wu, Sha Li, Heng Ji

    Abstract: Open-vocabulary state tracking is a more practical version of state tracking that aims to track state changes of entities throughout a process without restricting the state space and entity space. OpenPI is to date the only dataset annotated for open-vocabulary state tracking. However, we identify issues with the dataset quality and evaluation metric. For the dataset, we categorize 3 types of prob… ▽ More

    Submitted 20 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: ACL 2023 findings (fix typo)

  22. arXiv:2305.18641  [pdf, other

    cs.CL cs.CV

    Enhanced Chart Understanding in Vision and Language Task via Cross-modal Pre-training on Plot Table Pairs

    Authors: Mingyang Zhou, Yi R. Fung, Long Chen, Christopher Thomas, Heng Ji, Shih-Fu Chang

    Abstract: Building cross-model intelligence that can understand charts and communicate the salient information hidden behind them is an appealing challenge in the vision and language(V+L) community. The capability to uncover the underlined table data of chart figures is a critical key to automatic chart understanding. We introduce ChartT5, a V+L model that learns how to interpret table information from char… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted by Findings of ACL 2023

  23. arXiv:2305.18582  [pdf, other

    cs.CL

    Information Association for Language Model Updating by Mitigating LM-Logical Discrepancy

    Authors: Pengfei Yu, Heng Ji

    Abstract: Large Language Models~(LLMs) struggle with providing current information due to the outdated pre-training data. Existing methods for updating LLMs, such as knowledge editing and continual fine-tuning, have significant drawbacks in generalizability of new information and the requirements on structured updating corpus. We identify the core challenge behind these drawbacks: the LM-logical discrepancy… ▽ More

    Submitted 9 February, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

  24. arXiv:2305.18503  [pdf, other

    cs.CL cs.CR cs.LG

    From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework

    Authors: Yangyi Chen, Hongcheng Gao, Ganqu Cui, Lifan Yuan, Dehan Kong, Hanlu Wu, Ning Shi, Bo Yuan, Longtao Huang, Hui Xue, Zhiyuan Liu, Maosong Sun, Heng Ji

    Abstract: Textual adversarial attacks can discover models' weaknesses by adding semantic-preserved but misleading perturbations to the inputs. The long-lasting adversarial attack-and-defense arms race in Natural Language Processing (NLP) is algorithm-centric, providing valuable techniques for automatic robustness evaluation. However, the existing practice of robustness evaluation may exhibit issues of incom… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of ACL 2023

  25. arXiv:2305.17542  [pdf, other

    cs.CL cs.MM

    Non-Sequential Graph Script Induction via Multimedia Grounding

    Authors: Yu Zhou, Sha Li, Manling Li, Xudong Lin, Shih-Fu Chang, Mohit Bansal, Heng Ji

    Abstract: Online resources such as WikiHow compile a wide range of scripts for performing everyday tasks, which can assist models in learning to reason about procedures. However, the scripts are always presented in a linear manner, which does not reflect the flexibility displayed by people executing tasks in real life. For example, in the CrossTask Dataset, 64.5% of consecutive step pairs are also observed… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

  26. arXiv:2305.17373  [pdf, other

    cs.CL cs.AI

    Zero- and Few-Shot Event Detection via Prompt-Based Meta Learning

    Authors: Zhenrui Yue, Huimin Zeng, Mengfei Lan, Heng Ji, Dong Wang

    Abstract: With emerging online topics as a source for numerous new events, detecting unseen / rare event types presents an elusive challenge for existing event detection methods, where only limited data access is provided for training. To address the data scarcity problem in event detection, we propose MetaEvent, a meta learning-based framework for zero- and few-shot event detection. Specifically, we sample… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023

  27. arXiv:2305.16470  [pdf, other

    cs.CL cs.LG

    Measuring the Effect of Influential Messages on Varying Personas

    Authors: Chenkai Sun, **ning Li, Hou Pong Chan, ChengXiang Zhai, Heng Ji

    Abstract: Predicting how a user responds to news events enables important applications such as allowing intelligent agents or content producers to estimate the effect on different communities and revise unreleased messages to prevent unexpected bad outcomes such as social conflict and moral injury. We present a new task, Response Forecasting on Personas for News Media, to estimate the response a persona (ch… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  28. arXiv:2305.16133  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    OVO: Open-Vocabulary Occupancy

    Authors: Zhiyu Tan, Zichao Dong, Cheng Zhang, Weikun Zhang, Hang Ji, Hao Li

    Abstract: Semantic occupancy prediction aims to infer dense geometry and semantics of surroundings for an autonomous agent to operate safely in the 3D environment. Existing occupancy prediction methods are almost entirely trained on human-annotated volumetric data. Although of high quality, the generation of such 3D annotations is laborious and costly, restricting them to a few specific object categories in… ▽ More

    Submitted 14 June, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

  29. arXiv:2305.14647  [pdf, other

    cs.CL

    Scientific Opinion Summarization: Paper Meta-review Generation Dataset, Methods, and Evaluation

    Authors: Qi Zeng, Mankeerat Sidhu, Ansel Blume, Hou Pong Chan, Lu Wang, Heng Ji

    Abstract: Opinions in scientific research papers can be divergent, leading to controversies among reviewers. However, most existing datasets for opinion summarization are centered around product reviews and assume that the analyzed opinions are non-controversial, failing to account for the variability seen in other contexts such as academic papers, political debates, or social media discussions. To address… ▽ More

    Submitted 15 June, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: IJCAI 2024 AI4Research Workshop

  30. arXiv:2305.14548  [pdf, other

    cs.CL

    Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarization

    Authors: Hou Pong Chan, Qi Zeng, Heng Ji

    Abstract: Existing factual consistency evaluation approaches for text summarization provide binary predictions and limited insights into the weakness of summarization systems. Therefore, we propose the task of fine-grained inconsistency detection, the goal of which is to predict the fine-grained types of factual errors in a summary. Motivated by how humans inspect factual inconsistency in summaries, we prop… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL Findings 2023. Code and data are available at https://github.com/kenchan0226/fineGrainedFact

  31. arXiv:2305.14318  [pdf, other

    cs.CL

    CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models

    Authors: Cheng Qian, Chi Han, Yi R. Fung, Yujia Qin, Zhiyuan Liu, Heng Ji

    Abstract: Large Language Models (LLMs) have made significant progress in utilizing tools, but their ability is limited by API availability and the instability of implicit reasoning, particularly when both planning and execution are involved. To overcome these limitations, we propose CREATOR, a novel framework that enables LLMs to create their own tools using documentation and code realization. CREATOR disen… ▽ More

    Submitted 21 June, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Findings of EMNLP 2023

  32. arXiv:2305.14259  [pdf, other

    cs.CL cs.AI cs.LG

    SciMON: Scientific Inspiration Machines Optimized for Novelty

    Authors: Qingyun Wang, Doug Downey, Heng Ji, Tom Hope

    Abstract: We explore and enhance the ability of neural language models to generate novel scientific directions grounded in literature. Work on literature-based hypothesis generation has traditionally focused on binary link prediction--severely limiting the expressivity of hypotheses. This line of work also does not focus on optimizing novelty. We take a dramatic departure with a novel setting in which model… ▽ More

    Submitted 3 June, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 21 pages. Code and resource are available at https://github.com/EagleW/CLBD Accepted by the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

  33. arXiv:2305.14225  [pdf, other

    cs.CL

    ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media

    Authors: Kung-Hsiang Huang, Hou Pong Chan, Kathleen McKeown, Heng Ji

    Abstract: Considerable advancements have been made to tackle the misrepresentation of information derived from reference articles in the domains of fact-checking and faithful summarization. However, an unaddressed aspect remains - the identification of social media posts that manipulate information within associated news articles. This task presents a significant challenge, primarily due to the prevalence o… ▽ More

    Submitted 12 June, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  34. arXiv:2305.12798  [pdf, other

    cs.CL cs.AI cs.LG

    Word Embeddings Are Steers for Language Models

    Authors: Chi Han, Jialiang Xu, Manling Li, Yi Fung, Chenkai Sun, Nan Jiang, Tarek Abdelzaher, Heng Ji

    Abstract: Language models (LMs) automatically learn word embeddings during pre-training on language corpora. Although word embeddings are usually interpreted as feature vectors for individual words, their roles in language model generation remain underexplored. In this work, we theoretically and empirically revisit output word embeddings and find that their linear transformations are equivalent to steering… ▽ More

    Submitted 6 June, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: ACL 2024 Long Paper, 9 pages, 3 figures

  35. arXiv:2305.12766  [pdf, other

    cs.CL cs.AI cs.LG

    Explaining Emergent In-Context Learning as Kernel Regression

    Authors: Chi Han, Ziqi Wang, Han Zhao, Heng Ji

    Abstract: Large language models (LLMs) have initiated a paradigm shift in transfer learning. In contrast to the classic pretraining-then-finetuning procedure, in order to use LLMs for downstream prediction tasks, one only needs to provide a few demonstrations, known as in-context examples, without adding more or updating existing model parameters. This in-context learning (ICL) capability of LLMs is intrigu… ▽ More

    Submitted 5 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 9 pages, 4 figures

  36. arXiv:2305.12738  [pdf, other

    cs.AI cs.LG cs.LO

    Logical Entity Representation in Knowledge-Graphs for Differentiable Rule Learning

    Authors: Chi Han, Qizheng He, Charles Yu, Xinya Du, Hanghang Tong, Heng Ji

    Abstract: Probabilistic logical rule learning has shown great strength in logical rule mining and knowledge graph completion. It learns logical rules to predict missing edges by reasoning on existing edges in the knowledge graph. However, previous efforts have largely been limited to only modeling chain-like Horn clauses such as $R_1(x,z)\land R_2(z,y)\Rightarrow H(x,y)$. This formulation overlooks addition… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 9 pages, 5 figures; accepted by 11th International Conference on Learning Representations (ICLR 2023)

  37. arXiv:2305.12565  [pdf, other

    cs.CL

    Understanding the Effect of Data Augmentation on Knowledge Distillation

    Authors: Ziqi Wang, Chi Han, Wenxuan Bao, Heng Ji

    Abstract: Knowledge distillation (KD) requires sufficient data to transfer knowledge from large-scale teacher models to small-scale student models. Therefore, data augmentation has been widely used to mitigate the shortage of data under specific scenarios. Classic data augmentation techniques, such as synonym replacement and k-nearest-neighbors, are initially designed for fine-tuning. To avoid severe semant… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: 14 pages, 8 tables, 5 figures

  38. arXiv:2305.11744  [pdf, other

    cs.IR cs.CL

    ReFIT: Relevance Feedback from a Reranker during Inference

    Authors: Revanth Gangi Reddy, Pradeep Dasigi, Md Arafat Sultan, Arman Cohan, Avirup Sil, Heng Ji, Hannaneh Hajishirzi

    Abstract: Retrieve-and-rerank is a prevalent framework in neural information retrieval, wherein a bi-encoder network initially retrieves a pre-defined number of candidates (e.g., K=100), which are then reranked by a more powerful cross-encoder model. While the reranker often yields improved candidate scores compared to the retriever, its scope is confined to only the top K retrieved candidates. As a result,… ▽ More

    Submitted 28 May, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Preprint

  39. arXiv:2305.11499  [pdf, other

    cs.CL

    RCOT: Detecting and Rectifying Factual Inconsistency in Reasoning by Reversing Chain-of-Thought

    Authors: Tianci Xue, Ziqi Wang, Zhenhailong Wang, Chi Han, Pengfei Yu, Heng Ji

    Abstract: Large language Models (LLMs) have achieved promising performance on arithmetic reasoning tasks by incorporating step-by-step chain-of-thought (CoT) prompting. However, LLMs face challenges in maintaining factual consistency during reasoning, exhibiting tendencies to condition overlooking, question misinterpretation, and condition hallucination over given problems. Existing methods use coarse-grain… ▽ More

    Submitted 1 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: 24 pages, 21 figures

  40. arXiv:2305.10683  [pdf, other

    cs.CV cs.CL

    Paxion: Patching Action Knowledge in Video-Language Foundation Models

    Authors: Zhenhailong Wang, Ansel Blume, Sha Li, Genglin Liu, Jaemin Cho, Zineng Tang, Mohit Bansal, Heng Ji

    Abstract: Action knowledge involves the understanding of textual, visual, and temporal aspects of actions. We introduce the Action Dynamics Benchmark (ActionBench) containing two carefully designed probing tasks: Action Antonym and Video Reversal, which targets multimodal alignment capabilities and temporal understanding skills of the model, respectively. Despite recent video-language models' (VidLM) impres… ▽ More

    Submitted 21 October, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 spotlight

  41. arXiv:2305.10314  [pdf, other

    cs.CL cs.AI cs.SE

    LeTI: Learning to Generate from Textual Interactions

    Authors: Xingyao Wang, Hao Peng, Reyhaneh Jabbarvand, Heng Ji

    Abstract: Fine-tuning pre-trained language models (LMs) is essential for enhancing their capabilities. Existing techniques commonly fine-tune on input-output pairs (e.g., instruction tuning) or with numerical rewards that gauge the output quality (e.g., RLHF). We explore LMs' potential to learn from textual interactions (LETI) that not only check their correctness with binary labels but also pinpoint and ex… ▽ More

    Submitted 19 March, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: NAACL 2024 Findings

  42. arXiv:2305.07982  [pdf, other

    cs.CL

    Zero-shot Faithful Factual Error Correction

    Authors: Kung-Hsiang Huang, Hou Pong Chan, Heng Ji

    Abstract: Faithfully correcting factual errors is critical for maintaining the integrity of textual knowledge bases and preventing hallucinations in sequence-to-sequence models. Drawing on humans' ability to identify and correct factual errors, we present a zero-shot framework that formulates questions about input claims, looks for correct answers in the given evidence, and assesses the faithfulness of each… ▽ More

    Submitted 27 May, 2023; v1 submitted 13 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL 2023

  43. arXiv:2305.06407  [pdf, other

    cs.CV cs.AI

    Combo of Thinking and Observing for Outside-Knowledge VQA

    Authors: Qingyi Si, Yuchen Mo, Zheng Lin, Huishan Ji, Wei** Wang

    Abstract: Outside-knowledge visual question answering is a challenging task that requires both the acquisition and the use of open-ended real-world knowledge. Some existing solutions draw external knowledge into the cross-modality space which overlooks the much vaster textual knowledge in natural-language space, while others transform the image into a text that further fuses with the textual knowledge into… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: ACL-23, Main Conference

  44. arXiv:2304.10689  [pdf, ps, other

    math.DS

    Decay of geometry for a class of cubic polynomials

    Authors: Haoyang Ji, Wenxiu Ma

    Abstract: In this paper we study a class of bimodal cubic polynomials for which its critical points have the same $ω$-limit set which is an invariant Cantor set. These maps have generalized Fibonacci combinatorics in terms of generalized renormalization on the twin principal nest. It is proved that such maps possess `decay of geometry' in the sense that the scaling factor of the twin principal nest decrease… ▽ More

    Submitted 5 July, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

  45. arXiv:2304.08354  [pdf, other

    cs.CL cs.AI cs.LG

    Tool Learning with Foundation Models

    Authors: Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, **g Yi, Yuzhang Zhu , et al. (16 additional authors not shown)

    Abstract: Humans possess an extraordinary ability to create and utilize tools, allowing them to overcome physical limitations and explore new frontiers. With the advent of foundation models, AI systems have the potential to be equally adept in tool use as humans. This paradigm, i.e., tool learning with foundation models, combines the strengths of specialized tools and foundation models to achieve enhanced a… ▽ More

    Submitted 15 June, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

  46. Demystifying CXL Memory with Genuine CXL-Ready Systems and Devices

    Authors: Yan Sun, Yifan Yuan, Zeduo Yu, Reese Kuper, Chihun Song, **ghan Huang, Houxiang Ji, Siddharth Agarwal, Jiaqi Lou, Ipoom Jeong, Ren Wang, Jung Ho Ahn, Tianyin Xu, Nam Sung Kim

    Abstract: The ever-growing demands for memory with larger capacity and higher bandwidth have driven recent innovations on memory expansion and disaggregation technologies based on Compute eXpress Link (CXL). Especially, CXL-based memory expansion technology has recently gained notable attention for its ability not only to economically expand memory capacity and bandwidth but also to decouple memory technolo… ▽ More

    Submitted 4 October, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: This paper has been accepted by MICRO'23. Please refer to the https://doi.org/10.1145/3613424.3614256 for the official version of this paper

    ACM Class: C.4; D.4; C.0

  47. arXiv:2303.14794  [pdf, other

    cond-mat.mes-hall

    Anomalous open orbits in Hofstadter spectrum of Chern insulator

    Authors: Haijiao Ji, Noah F. Q. Yuan, Hua Jiang, Haiwen Liu, X. C. Xie

    Abstract: The nontrivial band topology can influence the Hofstadter spectrum. We investigate the Hofstadter spectrum for various models of Chern insulators under a rational flux $\frac{φ_{0}}{q}$, here $φ_{0}=\frac{h}{e}$ and $q$ being an integer. We find two major features. First, the number of splitting subbands is $|q-C|$ with Chern number $C$. Second, the anomalous open-orbit subbands with Chern numbers… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

  48. arXiv:2303.14728  [pdf, other

    physics.flu-dyn math.AP math.DS

    Coarsening of thin films with weak condensation

    Authors: Hangjie Ji, Thomas P. Witelski

    Abstract: A lubrication model can be used to describe the dynamics of a weakly volatile viscous fluid layer on a hydrophobic substrate. Thin layers of the fluid are unstable to perturbations and break up into slowly evolving interacting droplets. A reduced-order dynamical system is derived from the lubrication model based on the nearest-neighbor droplet interactions in the weak condensation limit. Dynamics… ▽ More

    Submitted 16 January, 2024; v1 submitted 26 March, 2023; originally announced March 2023.

    Comments: 24 pages, 10 figures

  49. arXiv:2303.14337  [pdf, other

    cs.CL

    SmartBook: AI-Assisted Situation Report Generation for Intelligence Analysts

    Authors: Revanth Gangi Reddy, Daniel Lee, Yi R. Fung, Khanh Duy Nguyen, Qi Zeng, Manling Li, Ziqi Wang, Clare Voss, Heng Ji

    Abstract: Timely and comprehensive understanding of emerging events is crucial for effective decision-making; automating situation report generation can significantly reduce the time, effort, and cost for intelligence analysts. In this work, we identify intelligence analysts' practices and preferences for AI assistance in situation report generation to guide the design strategies for an effective, trust-bui… ▽ More

    Submitted 27 May, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: Preprint

  50. arXiv:2303.09093  [pdf, other

    cs.CL

    GLEN: General-Purpose Event Detection for Thousands of Types

    Authors: Qiusi Zhan, Sha Li, Kathryn Conger, Martha Palmer, Heng Ji, Jiawei Han

    Abstract: The progress of event extraction research has been hindered by the absence of wide-coverage, large-scale datasets. To make event extraction systems more accessible, we build a general-purpose event detection dataset GLEN, which covers 205K event mentions with 3,465 different types, making it more than 20x larger in ontology than today's largest event dataset. GLEN is created by utilizing the DWD O… ▽ More

    Submitted 31 October, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: Accepted to EMNLP 2023. The first two authors contributed equally. (16 pages)