Skip to main content

Showing 1–50 of 551 results for author: Ji, H

.
  1. arXiv:2407.04929  [pdf, other

    cs.RO

    Toward Precise Robotic Weed Flaming Using a Mobile Manipulator with a Flamethrower

    Authors: Di Wang, Chengsong Hu, Shuangyu Xie, Joe Johnson, Hojun Ji, Yingtao Jiang, Muthukumar Bagavathiannan, Dezhen Song

    Abstract: Robotic weed flaming is a new and environmentally friendly approach to weed removal in the agricultural field. Using a mobile manipulator equipped with a flamethrower, we design a new system and algorithm to enable effective weed flaming, which requires robotic manipulation with a soft and deformable end effector, as the thermal coverage of the flame is affected by dynamic or unknown environmental… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: IROS 2024

  2. arXiv:2407.03688  [pdf, other

    physics.optics

    Adaptive sampling strategy for tolerance analysis of freeform optical surfaces based on critical ray aiming

    Authors: Rundong Fan, Shili Wei, Zhuang Qian, Huiru Ji, Hao Tan, Yan Mo, Donglin Ma

    Abstract: The tolerance analysis of freeform surfaces plays a crucial role in the development of advanced imaging systems. However, the intricate relationship between surface error and imaging quality poses significant challenges, necessitating dense sampling of featured rays during the computation process to ensure an accurate tolerance for different fields of view (FOVs). Here, we propose an adaptive samp… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  3. arXiv:2407.03040  [pdf, other

    cs.CL cs.AI

    Raw Text is All you Need: Knowledge-intensive Multi-turn Instruction Tuning for Large Language Model

    Authors: Xia Hou, Qifeng Li, Jian Yang, Tongliang Li, Linzheng Chai, Xianjie Wu, Hangyuan Ji, Zhoujun Li, Jixuan Nie, **gbo Dun, Wenfeng Song

    Abstract: Instruction tuning as an effective technique aligns the outputs of large language models (LLMs) with human preference. But how to generate the seasonal multi-turn dialogues from raw documents for instruction tuning still requires further exploration. In this paper, we present a novel framework named R2S that leverages the CoD-Chain of Dialogue logic to guide large language models (LLMs) in generat… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 11 pages, 3 figures

    MSC Class: 68T50 ACM Class: I.2.7

  4. arXiv:2407.01100  [pdf, other

    cs.CL cs.LG

    Eliminating Position Bias of Language Models: A Mechanistic Approach

    Authors: Ziqi Wang, Hanlin Zhang, Xiner Li, Kuan-Hao Huang, Chi Han, Shuiwang Ji, Sham M. Kakade, Hao Peng, Heng Ji

    Abstract: Position bias has proven to be a prevalent issue of modern language models (LMs), where the models prioritize content based on its position within the given context. This bias often leads to unexpected model failures and hurts performance, robustness, and reliability across various applications. Our mechanistic analysis attributes the position bias to two components employed in nearly all state-of… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 18 pages, 5 figures

  5. arXiv:2407.00589  [pdf, other

    physics.flu-dyn

    Modeling film flows down a rotating slippery cylinder

    Authors: Souradip Chattopadhyay, Amar K. Gaonkar, Hangjie Ji

    Abstract: This study investigates the nonlinear stability and dynamics of gravity-driven viscous films on a vertical rotating cylinder, considering both outer and inner surface flows with slip conditions at the cylinder wall. We develop an asymptotic model for the combined effects of rotation and wall slippage. Linear stability analysis indicates that wall slippage enhances instability on both surfaces, whi… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 23 pages, 15 figures

  6. arXiv:2406.15657  [pdf, other

    cs.IR

    FIRST: Faster Improved Listwise Reranking with Single Token Decoding

    Authors: Revanth Gangi Reddy, JaeHyeok Doo, Yifei Xu, Md Arafat Sultan, Deevya Swain, Avirup Sil, Heng Ji

    Abstract: Large Language Models (LLMs) have significantly advanced the field of information retrieval, particularly for reranking. Listwise LLM rerankers have showcased superior performance and generalizability compared to existing supervised approaches. However, conventional listwise LLM reranking methods lack efficiency as they provide ranking output in the form of a generated ordered sequence of candidat… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Preprint

  7. arXiv:2406.14137  [pdf, other

    cs.CL

    MACAROON: Training Vision-Language Models To Be Your Engaged Partners

    Authors: Shu** Wu, Yi R. Fung, Sha Li, Yixin Wan, Kai-Wei Chang, Heng Ji

    Abstract: Large vision-language models (LVLMs), while proficient in following instructions and responding to diverse questions, invariably generate detailed responses even when questions are ambiguous or unanswerable, leading to hallucinations and bias issues. Thus, it is essential for LVLMs to proactively engage with humans to ask for clarifications or additional information for better responses. In this s… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: The code will be made public at https://github.com/Shu**Wu-0814/MACAROON

  8. arXiv:2406.07067  [pdf, other

    cs.IR cs.AI

    TIM: Temporal Interaction Model in Notification System

    Authors: Huxiao Ji, Haitao Yang, Linchuan Li, Shunyu Zhang, Cunyi Zhang, Xuan** Li, Wenwu Ou

    Abstract: Modern mobile applications heavily rely on the notification system to acquire daily active users and enhance user engagement. Being able to proactively reach users, the system has to decide when to send notifications to users. Although many researchers have studied optimizing the timing of sending notifications, they only utilized users' contextual features, without modeling users' behavior patter… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  9. arXiv:2406.03871  [pdf

    physics.acc-ph

    Development of high-level applications for High Energy Photon Source booster

    Authors: Yuemei Peng, Daheng Ji, Hongfei Ji, Nan Li, Xiaohan Lu, Saike Tian, Yuanyuan Wei, Haisheng Xu, Yaliang Zhao, Yi Jiao, **gyi Li

    Abstract: The High Energy Photon Source (HEPS), is the first fourth-generation storage ring light source being built in the suburb of Bei**g, China. The storage ring was designed with the emittance lower than 60 pm.rad with a circumference of 1.36 km and beam energy of 6 GeV. Its injector contains a 500 MeV S-band Linac and a 454 m booster which was designed as an accumulator at the extraction energy. In t… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  10. arXiv:2406.02056  [pdf, other

    cs.LG cs.NE

    CAP: A Context-Aware Neural Predictor for NAS

    Authors: Han Ji, Yuqi Feng, Yanan Sun

    Abstract: Neural predictors are effective in boosting the time-consuming performance evaluation stage in neural architecture search (NAS), owing to their direct estimation of unseen architectures. Despite the effectiveness, training a powerful neural predictor with fewer annotated architectures remains a huge challenge. In this paper, we propose a context-aware neural predictor (CAP) which only needs a few… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by IJCAI24

  11. arXiv:2406.00875  [pdf, other

    physics.plasm-ph astro-ph.EP astro-ph.HE astro-ph.SR physics.space-ph

    Ohm's Law, the Reconnection Rate, and Energy Conversion in Collisionless Magnetic Reconnection

    Authors: Yi-Hsin Liu, Michael Hesse, Kevin Genestreti, Rumi Nakamura, Jim Burch, Paul Cassak, Naoki Bessho, Jonathan Eastwood, Tai Phan, Marc Swisdak, Sergio Toledo-Redondo, Masahiro Hoshino, Cecilia Norgren, Hantao Ji, TKM Nakamura

    Abstract: Magnetic reconnection is a ubiquitous plasma process that transforms magnetic energy into particle energy during eruptive events throughout the universe. Reconnection not only converts energy during solar flares and geomagnetic substorms that drive space weather near Earth, but it may also play critical roles in the high energy emissions from the magnetospheres of neutron stars and black holes. In… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Submitted to Space Science Reviews. This is a review paper as an outcome of the 2022 Magnetic Reconnection Workshop in the International Space Science Institute

  12. arXiv:2405.20015  [pdf, other

    cs.AI cs.CL

    Efficient LLM-Jailbreaking by Introducing Visual Modality

    Authors: Zhenxing Niu, Yuyao Sun, Haodong Ren, Haoxuan Ji, Quan Wang, Xiaoke Ma, Gang Hua, Rong **

    Abstract: This paper focuses on jailbreaking attacks against large language models (LLMs), eliciting them to generate objectionable content in response to harmful user queries. Unlike previous LLM-jailbreaks that directly orient to LLMs, our approach begins by constructing a multimodal large language model (MLLM) through the incorporation of a visual module into the target LLM. Subsequently, we conduct an e… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  13. arXiv:2405.17303  [pdf, other

    astro-ph.SR

    High-Resolution Observation and Magnetic Modeling of a Solar Minifilament: the Formation, Eruption and Failing Mechanisms

    Authors: Weilin Teng, Yingna Su, Rui Liu, Jialin Chen, Yanjie Liu, Jun Dai, Wenda Cao, **hua Shen, Haisheng Ji

    Abstract: Minifilaments are widespread small-scale structures in the solar atmosphere. To better understand their formation and eruption mechanisms, we investigate the entire life of a sigmoidal minifilament located below a large quiescent filament observed by BBSO/GST on 2015 August 3. The Hα structure initially appears as a group of arched threads, then transforms into two J-shaped arcades, and finally fo… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  14. arXiv:2405.15028  [pdf, other

    cs.CL cs.IR

    AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings

    Authors: Revanth Gangi Reddy, Omar Attia, Yunyao Li, Heng Ji, Saloni Potdar

    Abstract: Ranking is a fundamental and popular problem in search. However, existing ranking algorithms usually restrict the granularity of ranking to full passages or require a specific dense index for each desired level of granularity. Such lack of flexibility in granularity negatively affects many applications that can benefit from more granular ranking, such as sentence-level ranking for open-domain ques… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  15. arXiv:2405.14203  [pdf, other

    cs.LG cs.AI physics.chem-ph

    GLaD: Synergizing Molecular Graphs and Language Descriptors for Enhanced Power Conversion Efficiency Prediction in Organic Photovoltaic Devices

    Authors: Thao Nguyen, Tiara Torres-Flores, Changhyun Hwang, Carl Edwards, Ying Diao, Heng Ji

    Abstract: This paper presents a novel approach for predicting Power Conversion Efficiency (PCE) of Organic Photovoltaic (OPV) devices, called GLaD: synergizing molecular Graphs and Language Descriptors for enhanced PCE prediction. Due to the lack of high-quality experimental data, we collect a dataset consisting of 500 pairs of OPV donor and acceptor molecules along with their corresponding PCE values, whic… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: In progress

  16. arXiv:2405.13179  [pdf, other

    cs.CL

    RAG-RLRC-LaySum at BioLaySumm: Integrating Retrieval-Augmented Generation and Readability Control for Layman Summarization of Biomedical Texts

    Authors: Yuelyu Ji, Zhuochun Li, Rui Meng, Sonish Sivarajkumar, Yanshan Wang, Zeshui Yu, Hui Ji, Yushui Han, Hanyu Zeng, Daqing He

    Abstract: This paper introduces the RAG-RLRC-LaySum framework, designed to make complex biomedical research understandable to laymen through advanced Natural Language Processing (NLP) techniques. Our Retrieval Augmented Generation (RAG) solution, enhanced by a reranking method, utilizes multiple knowledge sources to ensure the precision and pertinence of lay summaries. Additionally, our Reinforcement Learni… ▽ More

    Submitted 24 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  17. arXiv:2405.13005  [pdf

    cs.CL cs.AI cs.SI

    Understanding the Rare Inflammatory Disease Using Large Language Models and Social Media Data

    Authors: Nan Miles Xi, Hong-Long Ji, Lin Wang

    Abstract: Sarcoidosis is a rare inflammatory disease characterized by the formation of granulomas in various organs. The disease presents diagnostic and treatment challenges due to its diverse manifestations and unpredictable nature. In this study, we employed a Large Language Model (LLM) to analyze sarcoidosis-related discussions on the social media platform Reddit. Our findings underscore the efficacy of… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  18. arXiv:2405.05481  [pdf, other

    quant-ph

    Achieving millisecond coherence fluxonium through overlap Josephson junctions

    Authors: Fei Wang, Kannan Lu, Huijuan Zhan, Lu Ma, Feng Wu, Hantao Sun, Hao Deng, Yang Bai, Feng Bao, Xu Chang, Ran Gao, Xun Gao, Guicheng Gong, Lijuan Hu, Ruizi Hu, Honghong Ji, Xizheng Ma, Liyong Mao, Zhijun Song, Chengchun Tang, Hongcheng Wang, Tenghui Wang, Ziang Wang, Tian Xia, Hongxin Xu , et al. (10 additional authors not shown)

    Abstract: Fluxonium qubits are recognized for their high coherence times and high operation fidelities, attributed to their unique design incorporating over 100 Josephson junctions per superconducting loop. However, this complexity poses significant fabrication challenges, particularly in achieving high yield and junction uniformity with traditional methods. Here, we introduce an overlap process for Josephs… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  19. arXiv:2405.04602  [pdf, other

    cs.SE

    An Empirical Study of Kotlin-Java Interactions

    Authors: Qiong Feng, Huan Ji, Xiaotian Ma, Peng Liang

    Abstract: Background: Since Google introduced Kotlin as an official programming language for develo** Android apps in 2017, Kotlin has gained widespread adoption in Android development. The interoperability of Java and Kotlin's design nature allows them to coexist and interact with each other smoothly within a project. Aims: However, there is limited research on how Java and Kotlin interact with each othe… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  20. arXiv:2405.03446  [pdf, other

    cs.CR

    SEvenLLM: Benchmarking, Eliciting, and Enhancing Abilities of Large Language Models in Cyber Threat Intelligence

    Authors: Hangyuan Ji, Jian Yang, Linzheng Chai, Chaoren Wei, Liqun Yang, Yunlong Duan, Yunli Wang, Tianzhen Sun, Hongcheng Guo, Tongliang Li, Changyu Ren, Zhoujun Li

    Abstract: To address the increasing complexity and frequency of cybersecurity incidents emphasized by the recent cybersecurity threat reports with over 10 billion instances, cyber threat intelligence (CTI) plays a critical role in the modern cybersecurity landscape by offering the insights required to understand and combat the constantly evolving nature of cyber threats. Inspired by the powerful capability… ▽ More

    Submitted 3 June, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  21. arXiv:2404.17512  [pdf, other

    math.PR

    On the spectral edge of non-Hermitian random matrices

    Authors: Andrew Campbell, Giorgio Cipolloni, László Erdős, Hong Chang Ji

    Abstract: For general non-Hermitian random matrices $X$ and deterministic deformation matrices $A$, we prove that the local eigenvalue statistics of $A+X$ close to the typical edge points of its spectrum are universal. Furthermore, we show that under natural assumptions on $A$ the spectrum of $A+X$ does not have outliers at a distance larger than the natural fluctuation scale of the eigenvalues. As a conseq… ▽ More

    Submitted 6 May, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: 51 pages

    MSC Class: 15B52; 60B20

  22. arXiv:2404.16792  [pdf, other

    cs.LG cs.AI cs.CL

    Weak-to-Strong Extrapolation Expedites Alignment

    Authors: Chujie Zheng, Ziqi Wang, Heng Ji, Minlie Huang, Nanyun Peng

    Abstract: The open-source community is experiencing a surge in the release of large language models (LLMs) that are trained to follow instructions and align with human preference. However, further training to improve them still requires expensive computational resources and data annotations. Is it possible to bypass additional training and cost-effectively acquire better-aligned models? Inspired by the lite… ▽ More

    Submitted 22 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Add theoretical explanation and more evaluation results

  23. arXiv:2404.12666  [pdf, other

    cs.DC cs.CR cs.ET

    A Survey on Federated Analytics: Taxonomy, Enabling Techniques, Applications and Open Issues

    Authors: Zibo Wang, Haichao Ji, Yifei Zhu, Dan Wang, Zhu Han

    Abstract: The escalating influx of data generated by networked edge devices, coupled with the growing awareness of data privacy, has promoted a transformative shift in computing paradigms from centralized data processing to privacy-preserved distributed data processing. Federated analytics (FA) is an emerging technique to support collaborative data analytics among diverse data owners without centralizing th… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: This survey has been submitted to IEEE Communications Surveys & Tutorials

  24. arXiv:2404.12135  [pdf, other

    cs.MA cs.CR cs.DC

    mABC: multi-Agent Blockchain-Inspired Collaboration for root cause analysis in micro-services architecture

    Authors: Wei Zhang, Hongcheng Guo, Jian Yang, Yi Zhang, Chaoran Yan, Zhou** Tian, Hangyuan Ji, Zhoujun Li, Tongliang Li, Tieqiao Zheng, Chao Chen, Yi Liang, Xu Shi, Liangfan Zheng, Bo Zhang

    Abstract: The escalating complexity of micro-services architecture in cloud-native technologies poses significant challenges for maintaining system stability and efficiency. To conduct root cause analysis (RCA) and resolution of alert events, we propose a pioneering framework, multi-Agent Blockchain-inspired Collaboration for root cause analysis in micro-services architecture (mABC), to revolutionize the AI… ▽ More

    Submitted 3 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  25. arXiv:2404.06479  [pdf, other

    cs.CL cs.AI cs.CV

    Text-Based Reasoning About Vector Graphics

    Authors: Zhenhailong Wang, Joy Hsu, Xingyao Wang, Kuan-Hao Huang, Manling Li, Jiajun Wu, Heng Ji

    Abstract: While large multimodal models excel in broad vision-language benchmarks, they often struggle with tasks requiring precise perception of low-level visual details, such as comparing line lengths or solving simple mazes. In particular, this failure mode persists in question-answering tasks about vector graphics -- images composed purely of 2D objects and shapes. To address this challenge, we propose… ▽ More

    Submitted 24 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: Project page: https://mikewangwzhl.github.io/VDLM/

  26. arXiv:2404.01652  [pdf, other

    cs.CL cs.AI

    Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization

    Authors: Zixuan Zhang, Revanth Gangi Reddy, Kevin Small, Tong Zhang, Heng Ji

    Abstract: Open-domain Question Answering (OpenQA) aims at answering factual questions with an external large-scale knowledge corpus. However, real-world knowledge is not static; it updates and evolves continually. Such a dynamic characteristic of knowledge poses a vital challenge for these models, as the trained models need to constantly adapt to the latest information to make sure that the answers remain a… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted to NAACL 2024 Findings

  27. arXiv:2403.19081  [pdf

    physics.optics

    Surface variation analysis of freeform optical systems over surface frequency bands for prescribed wavefront errors

    Authors: Rundong Fan, Shili Wei, Huiru JI, Zhuang Qian, Hao Tan, Yan Mo, Donglin MA

    Abstract: The surface errors of freeform surfaces reflect the manufacturing complexities and significantly impact the feasibility of processing designed optical systems. With multiple degrees of freedom, freeform surfaces pose challenges in surface tolerance analysis in the field. Nevertheless, current research has neglected the influence of surface slopes on the directions of ray propagation. A sudden alte… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  28. arXiv:2403.18671  [pdf, other

    cs.CL cs.LG

    Fact Checking Beyond Training Set

    Authors: Payam Karisani, Heng Ji

    Abstract: Evaluating the veracity of everyday claims is time consuming and in some cases requires domain expertise. We empirically demonstrate that the commonly used fact checking pipeline, known as the retriever-reader, suffers from performance deterioration when it is trained on the labeled data from one domain and used in another domain. Afterwards, we delve into each component of the pipeline and propos… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: NAACL 2024

  29. arXiv:2403.16823  [pdf, ps, other

    eess.SY cs.LG

    Resource and Mobility Management in Hybrid LiFi and WiFi Networks: A User-Centric Learning Approach

    Authors: Han Ji, Xi** Wu

    Abstract: Hybrid light fidelity (LiFi) and wireless fidelity (WiFi) networks (HLWNets) are an emerging indoor wireless communication paradigm, which combines the advantages of the capacious optical spectra of LiFi and ubiquitous coverage of WiFi. Meanwhile, load balancing (LB) becomes a key challenge in resource management for such hybrid networks. The existing LB methods are mostly network-centric, relying… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 12 pages, 12 figures, 3 tables, submitted to IEEE TWC

  30. arXiv:2403.12027  [pdf, other

    cs.CL cs.AI cs.CV

    From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models

    Authors: Kung-Hsiang Huang, Hou Pong Chan, Yi R. Fung, Haoyi Qiu, Mingyang Zhou, Shafiq Joty, Shih-Fu Chang, Heng Ji

    Abstract: Data visualization in the form of charts plays a pivotal role in data analysis, offering critical insights and aiding in informed decision-making. Automatic chart understanding has witnessed significant advancements with the rise of large foundation models in recent years. Foundation models, such as large language models, have revolutionized various natural language processing tasks and are increa… ▽ More

    Submitted 25 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  31. arXiv:2403.08069  [pdf

    cond-mat.str-el cond-mat.mtrl-sci

    Noncentrosymmetric Triangular Magnet CaMnTeO$_6$: Strong Quantum Fluctuations and Role of s0 vs. s2 Electronic States in Competing Exchange Interactions

    Authors: Xudong Huai, Emmanuel Acheampong, Erich Delles, Michał J. Winiarski, Maurice Sorolla II, Lila Nassar, Mingli Liang, Caleb Ramette, Huiwen Ji, Allen Scheie, Stuart Calder, Martin Mourigal, Thao T. Tran

    Abstract: Noncentrosymmetric triangular magnets offer a unique platform for realizing strong quantum fluctuations. However, designing these quantum materials remains an open challenge attributable to a knowledge gap in the tunability of competing exchange interactions at the atomic level. Here, we create a new noncentrosymmetric triangular S = 3/2 magnet CaMnTeO$_6$ based on careful chemical and physical co… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  32. arXiv:2403.06093  [pdf, other

    cs.CV

    Enhancing 3D Object Detection with 2D Detection-Guided Query Anchors

    Authors: Haoxuanye Ji, Pengpeng Liang, Erkang Cheng

    Abstract: Multi-camera-based 3D object detection has made notable progress in the past several years. However, we observe that there are cases (e.g. faraway regions) in which popular 2D object detectors are more reliable than state-of-the-art 3D detectors. In this paper, to improve the performance of query-based 3D object detectors, we present a novel query generating approach termed QAF2D, which infers 3D… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  33. arXiv:2403.05159  [pdf, other

    cs.CV

    LVIC: Multi-modality segmentation by Lifting Visual Info as Cue

    Authors: Zichao Dong, Bowen Pang, Xufeng Huang, Hang Ji, Xin Zhan, Junbo Chen

    Abstract: Multi-modality fusion is proven an effective method for 3d perception for autonomous driving. However, most current multi-modality fusion pipelines for LiDAR semantic segmentation have complicated fusion mechanisms. Point painting is a quite straight forward method which directly bind LiDAR points with visual information. Unfortunately, previous point painting like methods suffer from projection e… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  34. arXiv:2403.00791  [pdf, other

    cs.CL cs.AI q-bio.BM q-bio.QM

    L+M-24: Building a Dataset for Language + Molecules @ ACL 2024

    Authors: Carl Edwards, Qingyun Wang, Lawrence Zhao, Heng Ji

    Abstract: Language-molecule models have emerged as an exciting direction for molecular discovery and understanding. However, training these models is challenging due to the scarcity of molecule-language pair datasets. At this point, datasets have been released which are 1) small and scraped from existing databases, 2) large but noisy and constructed by performing entity linking on the scientific literature,… ▽ More

    Submitted 4 July, 2024; v1 submitted 22 February, 2024; originally announced March 2024.

    Comments: The dataset, finetuned baselines, and evaluation code are released publicly at https://github.com/language-plus-molecules/LPM-24-Dataset through https://huggingface.co/language-plus-molecules

  35. arXiv:2402.19275  [pdf, other

    eess.SY cs.LG

    Adaptive Testing Environment Generation for Connected and Automated Vehicles with Dense Reinforcement Learning

    Authors: **gxuan Yang, Ruoxuan Bai, Haoyuan Ji, Yi Zhang, Jianming Hu, Shuo Feng

    Abstract: The assessment of safety performance plays a pivotal role in the development and deployment of connected and automated vehicles (CAVs). A common approach involves designing testing scenarios based on prior knowledge of CAVs (e.g., surrogate models), conducting tests in these scenarios, and subsequently evaluating CAVs' safety performances. However, substantial differences between CAVs and the prio… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  36. arXiv:2402.18077  [pdf, ps, other

    astro-ph.SR

    Locating heating channels of the solar corona in a plage region with the aid of high-resolution 10830 Å filtergrams

    Authors: Parida Hashim, Fangyu Xu, Ya Wang, Weijie Men, **hua Shen, Yingna Su, Jian** Li, Zhenyu **, Haisheng Ji

    Abstract: In this paper, with a set of high-resolution He I 10830 Å filtergrams, we select an area in a plage, very likely an EUV moss area, as an interface layer to follow the clues of coronal heating channels down to the photosphere. The filtergrams are obtained from the 1-meter aperture New Vacuum Solar Telescope (NVST). We make a distinction between the darker and the brighter regions in the selected ar… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: ApJ accepted for publication. 11 pages, 7 figures

  37. arXiv:2402.16315  [pdf, other

    cs.CV cs.CL

    Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models

    Authors: Jeonghwan Kim, Heng Ji

    Abstract: Recent advances in instruction-tuned Large Vision-Language Models (LVLMs) have imbued the models with the ability to generate high-level, image-grounded explanations with ease. While such capability is largely attributed to the rich world knowledge contained within the Large Language Models (LLMs), our work reveals their shortcomings in fine-grained visual categorization (FGVC) across six differen… ▽ More

    Submitted 11 March, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  38. arXiv:2402.15796  [pdf

    cs.AI cs.HC

    Construction and application of artificial intelligence crowdsourcing map based on multi-track GPS data

    Authors: Yong Wang, Yanlin Zhou, Huan Ji, Zheng He, Xinyu Shen

    Abstract: In recent years, the rapid development of high-precision map technology combined with artificial intelligence has ushered in a new development opportunity in the field of intelligent vehicles. High-precision map technology is an important guarantee for intelligent vehicles to achieve autonomous driving. However, due to the lack of research on high-precision map technology, it is difficult to ratio… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  39. arXiv:2402.14312  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.EP astro-ph.GA

    The Jiao Tong University Spectroscopic Telescope Project

    Authors: JUST Team, Chengze Liu, Ying Zu, Fabo Feng, Zhaoyu Li, Yu Yu, Hua Bai, Xiangqun Cui, Bozhong Gu, Yizhou Gu, Jiaxin Han, Yonghui Hou, Zhongwen Hu, Hangxin Ji, Yipeng **g, Wei Li, Zhaoxiang Qi, Xianyu Tan, Cairang Tian, Dehua Yang, Xiangyan Yuan, Chao Zhai, Congcong Zhang, Jun Zhang, Haotong Zhang , et al. (6 additional authors not shown)

    Abstract: The Jiao Tong University Spectroscopic Telescope (JUST) is a 4.4-meter f/6.0 segmentedmirror telescope dedicated to spectroscopic observations. The JUST primary mirror is composed of 18 hexagonal segments, each with a diameter of 1.1 m. JUST provides two Nasmyth platforms for placing science instruments. One Nasmyth focus fits a field of view of 10 arcmin and the other has an extended field of vie… ▽ More

    Submitted 29 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 28 pages, 6 figures

  40. arXiv:2402.14221  [pdf, other

    cs.DC

    Towards singular optimality in the presence of local initial knowledge

    Authors: Hongyan Ji, Sriram V. Pemmaraju

    Abstract: The Knowledge Till rho CONGEST model is a variant of the classical CONGEST model of distributed computing in which each vertex v has initial knowledge of the radius-rho ball centered at v. The most commonly studied variants of the CONGEST model are KT0 CONGEST in which nodes initially know nothing about their neighbors and KT1 CONGEST in which nodes initially know the IDs of all their neighbors. I… ▽ More

    Submitted 22 February, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  41. arXiv:2402.11943  [pdf, other

    cs.CL

    LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with External Knowledge Augmentation

    Authors: Keyang Xuan, Li Yi, Fan Yang, Ruochen Wu, Yi R. Fung, Heng Ji

    Abstract: The rise of multimodal misinformation on social platforms poses significant challenges for individuals and societies. Its increased credibility and broader impact compared to textual misinformation make detection complex, requiring robust reasoning across diverse media types and profound knowledge for accurate verification. The emergence of Large Vision Language Model (LVLM) offers a potential sol… ▽ More

    Submitted 20 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  42. arXiv:2402.11324  [pdf, other

    cs.CL

    EVEDIT: Event-based Knowledge Editing with Deductive Editing Boundaries

    Authors: Jiateng Liu, Pengfei Yu, Yuji Zhang, Sha Li, Zixuan Zhang, Heng Ji

    Abstract: The dynamic nature of real-world information necessitates efficient knowledge editing (KE) in large language models (LLMs) for knowledge updating. However, current KE approaches, which typically operate on (subject, relation, object) triples, ignore the contextual information and the relation among different knowledge. Such editing methods could thus encounter an uncertain editing boundary, leavin… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  43. arXiv:2402.11060  [pdf, other

    cs.CL cs.AI cs.IR

    Persona-DB: Efficient Large Language Model Personalization for Response Prediction with Collaborative Data Refinement

    Authors: Chenkai Sun, Ke Yang, Revanth Gangi Reddy, Yi R. Fung, Hou Pong Chan, ChengXiang Zhai, Heng Ji

    Abstract: The increasing demand for personalized interactions with large language models (LLMs) calls for the development of methodologies capable of accurately and efficiently identifying user opinions and preferences. Retrieval augmentation emerges as an effective strategy, as it can accommodate a vast number of users without the costs from fine-tuning. Existing research, however, has largely focused on e… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  44. arXiv:2402.10980  [pdf, other

    physics.chem-ph cs.AI cs.CE cs.LG

    ChemReasoner: Heuristic Search over a Large Language Model's Knowledge Space using Quantum-Chemical Feedback

    Authors: Henry W. Sprueill, Carl Edwards, Khushbu Agarwal, Mariefel V. Olarte, Udishnu Sanyal, Conrad Johnston, Hongbin Liu, Heng Ji, Sutanay Choudhury

    Abstract: The discovery of new catalysts is essential for the design of new and more efficient chemical processes in order to transition to a sustainable future. We introduce an AI-guided computational screening framework unifying linguistic reasoning with quantum-chemistry based feedback from 3D atomistic representations. Our approach formulates catalyst discovery as an uncertain environment where an agent… ▽ More

    Submitted 7 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: 9 pages, accepted by ICML 2024, final version

  45. arXiv:2402.09463  [pdf

    eess.IV

    Multi-Center Fetal Brain Tissue Annotation (FeTA) Challenge 2022 Results

    Authors: Kelly Payette, Céline Steger, Roxane Licandro, Priscille de Dumast, Hongwei Bran Li, Matthew Barkovich, Liu Li, Maik Dannecker, Chen Chen, Cheng Ouyang, Niccolò McConnell, Alina Miron, Yongmin Li, Alena Uus, Irina Grigorescu, Paula Ramirez Gilliland, Md Mahfuzur Rahman Siddiquee, Daguang Xu, Andriy Myronenko, Haoyu Wang, Ziyan Huang, ** Ye, Mireia Alenyà, Valentin Comte, Oscar Camara , et al. (42 additional authors not shown)

    Abstract: Segmentation is a critical step in analyzing the develo** human fetal brain. There have been vast improvements in automatic segmentation methods in the past several years, and the Fetal Brain Tissue Annotation (FeTA) Challenge 2021 helped to establish an excellent standard of fetal brain segmentation. However, FeTA 2021 was a single center study, and the generalizability of algorithms across dif… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Results from FeTA Challenge 2022, held at MICCAI; Manuscript submitted. Supplementary Info (including submission methods descriptions) available here: https://zenodo.org/records/10628648

  46. arXiv:2402.09369  [pdf, other

    cs.CL

    Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking

    Authors: Yi Fung, Ruining Zhao, Jae Doo, Chenkai Sun, Heng Ji

    Abstract: Pretrained large language models have revolutionized many applications but still face challenges related to cultural bias and a lack of cultural commonsense knowledge crucial for guiding cross-culture communication and interactions. Recognizing the shortcomings of existing methods in capturing the diverse and rich cultures across the world, this paper introduces a novel approach for massively mult… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: preprint

  47. arXiv:2402.07401  [pdf, other

    cs.CL

    Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate

    Authors: Kyungha Kim, Sangyun Lee, Kung-Hsiang Huang, Hou Pong Chan, Manling Li, Heng Ji

    Abstract: Fact-checking research has extensively explored verification but less so the generation of natural-language explanations, crucial for user trust. While Large Language Models (LLMs) excel in text generation, their capability for producing faithful explanations in fact-checking remains underexamined. Our study investigates LLMs' ability to generate such explanations, finding that zero-shot prompts o… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  48. arXiv:2402.07016  [pdf, other

    cs.AI

    REALM: RAG-Driven Enhancement of Multimodal Electronic Health Records Analysis via Large Language Models

    Authors: Yinghao Zhu, Changyu Ren, Shiyun Xie, Shukai Liu, Hangyuan Ji, Zixiang Wang, Tao Sun, Long He, Zhoujun Li, Xi Zhu, Chengwei Pan

    Abstract: The integration of multimodal Electronic Health Records (EHR) data has significantly improved clinical predictive capabilities. Leveraging clinical notes and multivariate time-series EHR, existing models often lack the medical context relevent to clinical tasks, prompting the incorporation of external knowledge, particularly from the knowledge graph (KG). Previous approaches with KG knowledge have… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

  49. arXiv:2402.06193  [pdf, other

    astro-ph.SR physics.plasm-ph

    Experimental study of Alfvén wave reflection from an Alfvén-speed gradient relevant to the solar coronal holes

    Authors: Sayak Bose, Jason M. TenBarge, Troy Carter, Michael Hahn, Hantao Ji, James Juno, Daniel Wolf Savin, Shreekrishna Tripathi, Stephen Vincena

    Abstract: We report the first experimental detection of a reflected Alfvén wave from an Alfvén-speed gradient under conditions similar to those in coronal holes. The experiments were conducted in the Large Plasma Device at the University of California, Los Angeles. We present the experimentally measured dependence of the coefficient of reflection versus the wave inhomogeneity parameter, i.e., the ratio of t… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  50. arXiv:2402.06190  [pdf, other

    cs.CV cs.LG

    Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain

    Authors: Amin Karimi Monsefi, Payam Karisani, Mengxi Zhou, Stacey Choi, Nathan Doble, Heng Ji, Srinivasan Parthasarathy, Rajiv Ramnath

    Abstract: Standard modern machine-learning-based imaging methods have faced challenges in medical applications due to the high cost of dataset construction and, thereby, the limited labeled training data available. Additionally, upon deployment, these methods are usually used to process a large volume of data on a daily basis, imposing a high maintenance cost on medical facilities. In this paper, we introdu… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.