Skip to main content

Showing 1–50 of 145 results for author: Ni, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16715  [pdf, other

    cs.LG

    GC-Bench: A Benchmark Framework for Graph Condensation with New Insights

    Authors: Shengbo Gong, Juntong Ni, Noveen Sachdeva, Carl Yang, Wei **

    Abstract: Graph condensation (GC) is an emerging technique designed to learn a significantly smaller graph that retains the essential information of the original graph. This condensed graph has shown promise in accelerating graph neural networks while preserving performance comparable to those achieved with the original, larger graphs. Additionally, this technique facilitates downstream applications such as… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 9 pages

  2. arXiv:2406.15658  [pdf, other

    cs.CV cs.AI

    TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation Learning

    Authors: Nemin Wu, Qian Cao, Zhangyu Wang, Ze** Liu, Yanlin Qi, Jielu Zhang, Joshua Ni, Xiaobai Yao, Hongxu Ma, Lan Mu, Stefano Ermon, Tanuja Ganu, Akshay Nambi, Ni Lao, Gengchen Mai

    Abstract: Spatial representation learning (SRL) aims at learning general-purpose neural network representations from various types of spatial data (e.g., points, polylines, polygons, networks, images, etc.) in their native formats. Learning good spatial representations is a fundamental problem for various downstream applications such as species distribution modeling, weather forecasting, trajectory generati… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 figures. Submitted to NeurIPS 2024 Datasets and Benchmarks Track. Under review

  3. arXiv:2406.14162  [pdf, other

    cs.IR cs.AI cs.CL

    DIRAS: Efficient LLM-Assisted Annotation of Document Relevance in Retrieval Augmented Generation

    Authors: **gwei Ni, Tobias Schimanski, Meihong Lin, Mrinmaya Sachan, Elliott Ash, Markus Leippold

    Abstract: Retrieval Augmented Generation (RAG) is widely employed to ground responses to queries on domain-specific documents. But do RAG implementations leave out important information or excessively include irrelevant information? To allay these concerns, it is necessary to annotate domain-specific benchmarks to evaluate information retrieval (IR) performance, as relevance definitions vary across queries… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2406.09818  [pdf, other

    cs.IR

    ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures

    Authors: Tobias Schimanski, **gwei Ni, Roberto Spacey, Nicola Ranger, Markus Leippold

    Abstract: To handle the vast amounts of qualitative data produced in corporate climate communication, stakeholders increasingly rely on Retrieval Augmented Generation (RAG) systems. However, a significant gap remains in evaluating domain-specific information retrieval - the basis for answer generation. To address this challenge, this work simulates the typical tasks of a sustainability analyst by examining… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  5. arXiv:2406.08380  [pdf, other

    cs.CL cs.SD eess.AS

    Towards Unsupervised Speech Recognition Without Pronunciation Models

    Authors: Junrui Ni, Liming Wang, Yang Zhang, Kaizhi Qian, Heting Gao, Mark Hasegawa-Johnson, Chang D. Yoo

    Abstract: Recent advancements in supervised automatic speech recognition (ASR) have achieved remarkable performance, largely due to the growing availability of large transcribed speech corpora. However, most languages lack sufficient paired speech and text data to effectively train these systems. In this article, we tackle the challenge of develo** ASR systems without paired speech and text corpora by pro… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  6. arXiv:2406.06565  [pdf, other

    cs.CL cs.AI cs.LG

    MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures

    Authors: **jie Ni, Fuzhao Xue, Xiang Yue, Yuntian Deng, Mahir Shah, Kabir Jain, Graham Neubig, Yang You

    Abstract: Evaluating large language models (LLMs) is challenging. Traditional ground-truth-based benchmarks fail to capture the comprehensiveness and nuance of real-world queries, while LLM-as-judge benchmarks suffer from grading biases and limited query quantity. Both of them may also become contaminated over time. User-facing evaluation, such as Chatbot Arena, provides reliable signals but is costly and s… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  7. arXiv:2405.15929  [pdf, other

    econ.GN cs.HC

    Product Design Using Generative Adversarial Network: Incorporating Consumer Preference and External Data

    Authors: Hui Li, Jian Ni, Fangzhu Yang

    Abstract: The development of generative artificial intelligence (AI) enables large-scale product design automation. However, this automated process usually does not incorporate consumer preference information from the internal dataset of a company. Furthermore, external sources such as social media and user-generated content (UGC) websites often contain rich product design and consumer preference informatio… ▽ More

    Submitted 2 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 46 pages, 26 figures, 5 tables

    ACM Class: I.2.6; I.5.1; I.5.4; H.2.8; J.4

  8. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  9. arXiv:2404.16666  [pdf, other

    cs.CV

    PhyRecon: Physically Plausible Neural Scene Reconstruction

    Authors: Junfeng Ni, Yixin Chen, Bohan **g, Nan Jiang, Bin Wang, Bo Dai, Puhao Li, Yixin Zhu, Song-Chun Zhu, Siyuan Huang

    Abstract: Neural implicit representations have gained popularity in multi-view 3D reconstruction. However, most previous work struggles to yield physically plausible results, limiting their utility in domains requiring rigorous physical accuracy, such as embodied AI and robotics. This lack of plausibility stems from the absence of physics modeling in existing methods and their inability to recover intricate… ▽ More

    Submitted 2 June, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: project page: https://phyrecon.github.io/. arXiv admin note: text overlap with arXiv:2303.08605 by other authors

  10. arXiv:2404.15349  [pdf, other

    eess.SP cs.LG cs.MM

    A Survey on Multimodal Wearable Sensor-based Human Action Recognition

    Authors: Jianyuan Ni, Hao Tang, Syed Tousiful Haque, Yan Yan, Anne H. H. Ngu

    Abstract: The combination of increased life expectancy and falling birth rates is resulting in an aging population. Wearable Sensor-based Human Activity Recognition (WSHAR) emerges as a promising assistive technology to support the daily lives of older individuals, unlocking vast potential for human-centric applications. However, recent surveys in WSHAR have been limited, focusing either solely on deep lear… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Multimodal Survey for Wearable Sensor-based Human Action Recognition

  11. arXiv:2403.11391  [pdf, other

    cs.LG cs.CV

    Investigating the Benefits of Projection Head for Representation Learning

    Authors: Yihao Xue, Eric Gan, Jiayi Ni, Siddharth Joshi, Baharan Mirzasoleiman

    Abstract: An effective technique for obtaining high-quality representations is adding a projection head on top of the encoder during training, then discarding it and using the pre-projection representations. Despite its proven practical effectiveness, the reason behind the success of this technique is poorly understood. The pre-projection representations are not directly optimized by the loss function, rais… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Journal ref: ICLR 2024

  12. arXiv:2402.11073  [pdf, other

    cs.CL cs.AI

    AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM Annotators

    Authors: **gwei Ni, Min**g Shi, Dominik Stammbach, Mrinmaya Sachan, Elliott Ash, Markus Leippold

    Abstract: With the rise of generative AI, automated fact-checking methods to combat misinformation are becoming more and more important. However, factual claim detection, the first step in a fact-checking pipeline, suffers from two key issues that limit its scalability and generalizability: (1) inconsistency in definitions of the task and what a claim is, and (2) the high cost of manual annotation. To addre… ▽ More

    Submitted 2 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: ACL2024 Main Conference

  13. arXiv:2402.09668  [pdf, other

    cs.LG cs.AI cs.CL

    How to Train Data-Efficient LLMs

    Authors: Noveen Sachdeva, Benjamin Coleman, Wang-Cheng Kang, Jianmo Ni, Lichan Hong, Ed H. Chi, James Caverlee, Julian McAuley, Derek Zhiyuan Cheng

    Abstract: The training of large language models (LLMs) is expensive. In this paper, we study data-efficient approaches for pre-training LLMs, i.e., techniques that aim to optimize the Pareto frontier of model quality and training resource/data consumption. We seek to understand the tradeoffs associated with data selection routines based on (i) expensive-to-compute data-quality estimates, and (ii) maximizati… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Under review. 44 pages, 30 figures

  14. arXiv:2402.08277  [pdf, other

    cs.CL cs.LG

    Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering

    Authors: Tobias Schimanski, **gwei Ni, Mathias Kraus, Elliott Ash, Markus Leippold

    Abstract: Advances towards more faithful and traceable answers of Large Language Models (LLMs) are crucial for various research and practical endeavors. One avenue in reaching this goal is basing the answers on reliable sources. However, this Evidence-Based QA has proven to work insufficiently with LLMs in terms of citing the correct sources (source quality) and truthfully representing the information withi… ▽ More

    Submitted 3 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  15. arXiv:2402.03358  [pdf, other

    cs.SI cs.AI cs.DS cs.LG

    A Comprehensive Survey on Graph Reduction: Sparsification, Coarsening, and Condensation

    Authors: Mohammad Hashemi, Shengbo Gong, Juntong Ni, Wenqi Fan, B. Aditya Prakash, Wei **

    Abstract: Many real-world datasets can be naturally represented as graphs, spanning a wide range of domains. However, the increasing complexity and size of graph datasets present significant challenges for analysis and computation. In response, graph reduction, or graph summarization, has gained prominence for simplifying large graphs while preserving essential properties. In this survey, we aim to provide… ▽ More

    Submitted 29 June, 2024; v1 submitted 28 January, 2024; originally announced February 2024.

    Comments: Accepted by IJCAI 2024 (This ArXiv version is a long version of our IJCAI paper)

  16. arXiv:2402.02036  [pdf, other

    cs.LG

    Generating In-Distribution Proxy Graphs for Explaining Graph Neural Networks

    Authors: Zhuomin Chen, Jiaxing Zhang, **gchao Ni, Xiaoting Li, Yuchen Bian, Md Mezbahul Islam, Ananda Mohan Mondal, Hua Wei, Dongsheng Luo

    Abstract: Graph Neural Networks (GNNs) have become a building block in graph data processing, with wide applications in critical domains. The growing needs to deploy GNNs in high-stakes applications necessitate explainability for users in the decision-making processes. A popular paradigm for the explainability of GNNs is to identify explainable subgraphs by comparing their labels with the ones of original g… ▽ More

    Submitted 29 May, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: Accepted to International Conference on Machine Learning (ICML 2024)

  17. arXiv:2402.01739  [pdf, other

    cs.CL cs.AI cs.DC cs.LG

    OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models

    Authors: Fuzhao Xue, Zian Zheng, Yao Fu, **jie Ni, Zangwei Zheng, Wangchunshu Zhou, Yang You

    Abstract: To help the open-source community have a better understanding of Mixture-of-Experts (MoE) based large language models (LLMs), we train and release OpenMoE, a series of fully open-sourced and reproducible decoder-only MoE LLMs, ranging from 650M to 34B parameters and trained on up to over 1T tokens. Our investigation confirms that MoE-based LLMs can offer a more favorable cost-effectiveness trade-o… ▽ More

    Submitted 27 March, 2024; v1 submitted 29 January, 2024; originally announced February 2024.

  18. arXiv:2401.17865  [pdf, other

    cs.LG cs.AI

    Manipulating Predictions over Discrete Inputs in Machine Teaching

    Authors: Xiaodong Wu, Yufei Han, Hayssam Dahrouj, Jianbing Ni, Zhenwen Liang, Xiangliang Zhang

    Abstract: Machine teaching often involves the creation of an optimal (typically minimal) dataset to help a model (referred to as the `student') achieve specific goals given by a teacher. While abundant in the continuous domain, the studies on the effectiveness of machine teaching in the discrete domain are relatively limited. This paper focuses on machine teaching in the discrete domain, specifically on man… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 8 pages, 2 figures

    ACM Class: I.2.6

  19. arXiv:2401.12566  [pdf, other

    cs.CL

    Automated Fact-Checking of Climate Change Claims with Large Language Models

    Authors: Markus Leippold, Saeid Ashraf Vaghefi, Dominik Stammbach, Veruska Muccione, Julia Bingler, **gwei Ni, Chiara Colesanti-Senni, Tobias Wekhof, Tobias Schimanski, Glen Gostlow, Tingyu Yu, Juerg Luterbacher, Christian Huggel

    Abstract: This paper presents Climinator, a novel AI-based tool designed to automate the fact-checking of climate change claims. Utilizing an array of Large Language Models (LLMs) informed by authoritative sources like the IPCC reports and peer-reviewed scientific literature, Climinator employs an innovative Mediator-Advocate framework. This design allows Climinator to effectively synthesize varying scienti… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  20. arXiv:2401.10338  [pdf, ps, other

    cs.LG

    MELODY: Robust Semi-Supervised Hybrid Model for Entity-Level Online Anomaly Detection with Multivariate Time Series

    Authors: **gchao Ni, Gauthier Guinet, Peihong Jiang, Laurent Callot, Andrey Kan

    Abstract: In large IT systems, software deployment is a crucial process in online services as their code is regularly updated. However, a faulty code change may degrade the target service's performance and cause cascading outages in downstream services. Thus, software deployments should be comprehensively monitored, and their anomalies should be detected timely. In this paper, we study the problem of anomal… ▽ More

    Submitted 6 June, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  21. arXiv:2401.07237  [pdf, other

    cs.CL cs.AI

    Distilling Event Sequence Knowledge From Large Language Models

    Authors: Somin Wadhwa, Oktie Hassanzadeh, Debarun Bhattacharjya, Ken Barker, Jian Ni

    Abstract: Event sequence models have been found to be highly effective in the analysis and prediction of events. Building such models requires availability of abundant high-quality event sequence data. In certain applications, however, clean structured event sequences are not available, and automated sequence extraction results in data that is too noisy and incomplete. In this work, we explore the use of La… ▽ More

    Submitted 6 February, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

    Comments: Under Review

  22. arXiv:2312.17337  [pdf, other

    cs.CL econ.GN

    Exploring Nature: Datasets and Models for Analyzing Nature-Related Disclosures

    Authors: Tobias Schimanski, Chiara Colesanti Senni, Glen Gostlow, **gwei Ni, Tingyu Yu, Markus Leippold

    Abstract: Nature is an amorphous concept. Yet, it is essential for the planet's well-being to understand how the economy interacts with it. To address the growing demand for information on corporate nature disclosure, we provide datasets and classifiers to detect nature communication by companies. We ground our approach in the guidelines of the Taskforce on Nature-related Financial Disclosures (TNFD). Parti… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  23. arXiv:2311.10255  [pdf, other

    cs.LG q-bio.PE

    FREE: The Foundational Semantic Recognition for Modeling Environmental Ecosystems

    Authors: Shiyuan Luo, Juntong Ni, Shengyu Chen, Runlong Yu, Yiqun Xie, Licheng Liu, Zhenong **, Huaxiu Yao, Xiaowei Jia

    Abstract: Modeling environmental ecosystems is critical for the sustainability of our planet, but is extremely challenging due to the complex underlying processes driven by interactions amongst a large number of physical variables. As many variables are difficult to measure at large scales, existing works often utilize a combination of observable features and locally available measurements or modeled values… ▽ More

    Submitted 19 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  24. arXiv:2311.09114  [pdf, other

    cs.CL cs.AI cs.LG

    Ever: Mitigating Hallucination in Large Language Models through Real-Time Verification and Rectification

    Authors: Haoqiang Kang, Juntong Ni, Huaxiu Yao

    Abstract: Large Language Models (LLMs) have demonstrated remarkable proficiency in generating fluent text. However, they often encounter the challenge of generating inaccurate or hallucinated content. This issue is common in both non-retrieval-based generation and retrieval-augmented generation approaches, and existing post-hoc rectification methods may not address the accumulated hallucination errors that… ▽ More

    Submitted 24 February, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  25. arXiv:2311.07912  [pdf, other

    cs.CV eess.SP

    Detection of Small Targets in Sea Clutter Based on RepVGG and Continuous Wavelet Transform

    Authors: **gchen Ni, Haoru Li, Lilin Xu, **g Liang

    Abstract: Constructing a high-performance target detector under the background of sea clutter is always necessary and important. In this work, we propose a RepVGGA0-CWT detector, where RepVGG is a residual network that gains a high detection accuracy. Different from traditional residual networks, RepVGG keeps an acceptable calculation speed. Giving consideration to both accuracy and speed, the RepVGGA0 is s… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  26. arXiv:2311.05800  [pdf, other

    cs.IR cs.AI cs.CL

    Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval

    Authors: Nandan Thakur, Jianmo Ni, Gustavo Hernández Ábrego, John Wieting, Jimmy Lin, Daniel Cer

    Abstract: There has been limited success for dense retrieval models in multilingual retrieval, due to uneven and scarce training data available across multiple languages. Synthetic training data generation is promising (e.g., InPars or Promptagator), but has been investigated only for English. Therefore, to study model capabilities across both cross-lingual and monolingual retrieval tasks, we develop SWIM-I… ▽ More

    Submitted 15 April, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: Accepted at NAACL 2024. Data released at https://github.com/google-research-datasets/swim-ir

  27. arXiv:2311.00457  [pdf, other

    cs.CV

    Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture

    Authors: Yixin Chen, Junfeng Ni, Nan Jiang, Yaowei Zhang, Yixin Zhu, Siyuan Huang

    Abstract: Reconstructing detailed 3D scenes from single-view images remains a challenging task due to limitations in existing approaches, which primarily focus on geometric shape recovery, overlooking object appearances and fine shape details. To address these challenges, we propose a novel framework for simultaneous high-fidelity recovery of object shapes and textures from single-view images. Our approach… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 3DV 2024, project page: https://dali-jack.github.io/SSR/

  28. arXiv:2310.18345  [pdf, other

    cs.CL cs.AI

    A Survey on Semantic Processing Techniques

    Authors: Rui Mao, Kai He, Xulang Zhang, Guanyi Chen, **jie Ni, Zonglin Yang, Erik Cambria

    Abstract: Semantic processing is a fundamental research domain in computational linguistics. In the era of powerful pre-trained language models and large language models, the advancement of research in this domain appears to be decelerating. However, the study of semantics is multi-dimensional in linguistics. The research depth and breadth of computational semantic processing can be largely improved with ne… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Published at Information Fusion, Volume 101, 2024, 101988, ISSN 1566-2535. The equal contribution mark is missed in the published version due to the publication policies. Please contact Prof. Erik Cambria for details

  29. arXiv:2310.09983  [pdf, other

    cs.LG cs.AI cs.CL cs.IR

    Farzi Data: Autoregressive Data Distillation

    Authors: Noveen Sachdeva, Zexue He, Wang-Cheng Kang, Jianmo Ni, Derek Zhiyuan Cheng, Julian McAuley

    Abstract: We study data distillation for auto-regressive machine learning tasks, where the input and output have a strict left-to-right causal structure. More specifically, we propose Farzi, which summarizes an event sequence dataset into a small number of synthetic sequences -- Farzi Data -- which are optimized to maintain (if not improve) model performance compared to training on the full dataset. Under t… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: Under review. 23 pages, 9 figures

  30. arXiv:2310.00402  [pdf, other

    cs.IR cs.DB

    DiskANN++: Efficient Page-based Search over Isomorphic Mapped Graph Index using Query-sensitivity Entry Vertex

    Authors: Jiongkang Ni, Xiaoliang Xu, Yuxiang Wang, Can Li, Jiajie Yao, Shihai Xiao, Xuecang Zhang

    Abstract: Given a vector dataset $\mathcal{X}$ and a query vector $\vec{x}_q$, graph-based Approximate Nearest Neighbor Search (ANNS) aims to build a graph index $G$ and approximately return vectors with minimum distances to $\vec{x}_q$ by searching over $G$. The main drawback of graph-based ANNS is that a graph index would be too large to fit into the memory especially for a large-scale $\mathcal{X}$. To s… ▽ More

    Submitted 30 November, 2023; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: 15 pages including references

  31. arXiv:2309.13604  [pdf, other

    cs.CV cs.AI

    Distribution-Aware Continual Test-Time Adaptation for Semantic Segmentation

    Authors: Jiayi Ni, Senqiao Yang, Ran Xu, Jiaming Liu, Xiaoqi Li, Wenyu Jiao, Zehui Chen, Yi Liu, Shanghang Zhang

    Abstract: Since autonomous driving systems usually face dynamic and ever-changing environments, continual test-time adaptation (CTTA) has been proposed as a strategy for transferring deployed models to continually changing target domains. However, the pursuit of long-term adaptation often introduces catastrophic forgetting and error accumulation problems, which impede the practical implementation of CTTA in… ▽ More

    Submitted 29 March, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

  32. Stochastic Deep Koopman Model for Quality Propagation Analysis in Multistage Manufacturing Systems

    Authors: Zhiyi Chen, Harshal Maske, Huanyi Shui, Devesh Upadhyay, Michael Hopka, Joseph Cohen, Xingjian Lai, Xun Huan, Jun Ni

    Abstract: The modeling of multistage manufacturing systems (MMSs) has attracted increased attention from both academia and industry. Recent advancements in deep learning methods provide an opportunity to accomplish this task with reduced cost and expertise. This study introduces a stochastic deep Koopman (SDK) framework to model the complex behavior of MMSs. Specifically, we present a novel application of K… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Journal ref: Journal of Manufacturing Systems 71 (2023) 609-619

  33. arXiv:2308.15027  [pdf, ps, other

    cs.IR cs.CL

    Improving Neural Ranking Models with Traditional IR Methods

    Authors: Anik Saha, Oktie Hassanzadeh, Alex Gittens, Jian Ni, Kavitha Srinivas, Bulent Yener

    Abstract: Neural ranking methods based on large transformer models have recently gained significant attention in the information retrieval community, and have been adopted by major commercial solutions. Nevertheless, they are computationally expensive to create, and require a great deal of labeled data for specialized corpora. In this paper, we explore a low resource alternative which is a bag-of-embedding… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: Short paper, 4 pages

  34. arXiv:2308.03891  [pdf, other

    cs.CL

    A Cross-Domain Evaluation of Approaches for Causal Knowledge Extraction

    Authors: Anik Saha, Oktie Hassanzadeh, Alex Gittens, Jian Ni, Kavitha Srinivas, Bulent Yener

    Abstract: Causal knowledge extraction is the task of extracting relevant causes and effects from text by detecting the causal relation. Although this task is important for language understanding and knowledge discovery, recent works in this domain have largely focused on binary classification of a text segment as causal or non-causal. In this regard, we perform a thorough analysis of three sequence tagging… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  35. arXiv:2307.15770  [pdf, other

    cs.CL cs.AI

    CHATREPORT: Democratizing Sustainability Disclosure Analysis through LLM-based Tools

    Authors: **gwei Ni, Julia Bingler, Chiara Colesanti-Senni, Mathias Kraus, Glen Gostlow, Tobias Schimanski, Dominik Stammbach, Saeid Ashraf Vaghefi, Qian Wang, Nicolas Webersinke, Tobias Wekhof, Tingyu Yu, Markus Leippold

    Abstract: In the face of climate change, are companies really taking substantial steps toward more sustainable operations? A comprehensive answer lies in the dense, information-rich landscape of corporate sustainability reports. However, the sheer volume and complexity of these reports make human analysis very costly. Therefore, only a few entities worldwide have the resources to analyze these reports at sc… ▽ More

    Submitted 11 October, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: 6 pages. arXiv admin note: text overlap with arXiv:2306.15518

  36. arXiv:2307.14192  [pdf, other

    cs.CR cs.AI

    Unveiling Security, Privacy, and Ethical Concerns of ChatGPT

    Authors: Xiaodong Wu, Ran Duan, Jianbing Ni

    Abstract: This paper delves into the realm of ChatGPT, an AI-powered chatbot that utilizes topic modeling and reinforcement learning to generate natural responses. Although ChatGPT holds immense promise across various industries, such as customer service, education, mental health treatment, personal productivity, and content creation, it is essential to address its security, privacy, and ethical implication… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  37. arXiv:2307.13259  [pdf, other

    cs.CV

    GaitFormer: Revisiting Intrinsic Periodicity for Gait Recognition

    Authors: Qian Wu, Ruixuan Xiao, Kaixin Xu, **gcheng Ni, Boxun Li, Ziyao Xu

    Abstract: Gait recognition aims to distinguish different walking patterns by analyzing video-level human silhouettes, rather than relying on appearance information. Previous research on gait recognition has primarily focused on extracting local or global spatial-temporal representations, while overlooking the intrinsic periodic features of gait sequences, which, when fully utilized, can significantly enhanc… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  38. arXiv:2307.10511  [pdf, other

    cs.CL

    General Debiasing for Multimodal Sentiment Analysis

    Authors: Teng Sun, Juntong Ni, Wenjie Wang, Liqiang **g, Yinwei Wei, Liqiang Nie

    Abstract: Existing work on Multimodal Sentiment Analysis (MSA) utilizes multimodal information for prediction yet unavoidably suffers from fitting the spurious correlations between multimodal features and sentiment labels. For example, if most videos with a blue background have positive labels in a dataset, the model will rely on such correlations for prediction, while "blue background" is not a sentiment-r… ▽ More

    Submitted 7 August, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: Accepted at ACM MM 2023

  39. arXiv:2307.04316  [pdf, ps, other

    cs.CR

    Accelerating Secure and Verifiable Data Deletion in Cloud Storage via SGX and Blockchain

    Authors: Xiangman Li, Jianbing Ni

    Abstract: Secure data deletion enables data owners to fully control the erasure of their data stored on local or cloud data centers and is essential for preventing data leakage, especially for cloud storage. However, traditional data deletion based on unlinking, overwriting, and cryptographic key management either ineffectiveness in cloud storage or rely on unpractical assumption. In this paper, we present… ▽ More

    Submitted 3 August, 2023; v1 submitted 9 July, 2023; originally announced July 2023.

    Comments: It has some technical problems, we need to address it before publishing. Many thanks

  40. arXiv:2307.03638   

    cs.MM cs.HC

    Physical-aware Cross-modal Adversarial Network for Wearable Sensor-based Human Action Recognition

    Authors: Jianyuan Ni, Hao Tang, Anne H. H. Ngu, Gaowen Liu, Yan Yan

    Abstract: Wearable sensor-based Human Action Recognition (HAR) has made significant strides in recent times. However, the accuracy performance of wearable sensor-based HAR is currently still lagging behind that of visual modalities-based systems, such as RGB video and depth data. Although diverse input modalities can provide complementary cues and improve the accuracy performance of HAR, wearable devices ca… ▽ More

    Submitted 19 May, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: We will be making some significant changes to the paper, including the title and methodology. We therefore wish to withdraw the paper for now

  41. arXiv:2306.16645  [pdf, other

    cs.CV cs.MM

    Deep Equilibrium Multimodal Fusion

    Authors: **hong Ni, Yalong Bai, Wei Zhang, Ting Yao, Tao Mei

    Abstract: Multimodal fusion integrates the complementary information present in multiple modalities and has gained much attention recently. Most existing fusion approaches either learn a fixed fusion strategy during training and inference, or are only capable of fusing the information to a certain extent. Such solutions may fail to fully capture the dynamics of interactions across modalities especially when… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  42. arXiv:2306.15518   

    cs.CL

    Paradigm Shift in Sustainability Disclosure Analysis: Empowering Stakeholders with CHATREPORT, a Language Model-Based Tool

    Authors: **gwei Ni, Julia Bingler, Chiara Colesanti-Senni, Mathias Kraus, Glen Gostlow, Tobias Schimanski, Dominik Stammbach, Saeid Ashraf Vaghefi, Qian Wang, Nicolas Webersinke, Tobias Wekhof, Tingyu Yu, Markus Leippold

    Abstract: This paper introduces a novel approach to enhance Large Language Models (LLMs) with expert knowledge to automate the analysis of corporate sustainability reports by benchmarking them against the Task Force for Climate-Related Financial Disclosures (TCFD) recommendations. Corporate sustainability reports are crucial in assessing organizations' environmental and social risks and impacts. However, an… ▽ More

    Submitted 16 November, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: A new version of the ChatReport paper: arXiv:2307.15770

  43. arXiv:2305.17524  [pdf, other

    cs.CR

    Secure and Privacy-preserving Network Slicing in 3GPP 5G System Architecture

    Authors: Xiangman Li, Miao He, Jianbing Ni

    Abstract: Network slicing in 3GPP 5G system architecture has introduced significant improvements in the flexibility and efficiency of mobile communication. However, this new functionality poses challenges in maintaining the privacy of mobile users, especially in multi-hop environments. In this paper, we propose a secure and privacy-preserving network slicing protocol (SPNS) that combines 5G network slicing… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

  44. arXiv:2305.17521  [pdf, other

    cs.CR

    Privacy-Preserving Model Aggregation for Asynchronous Federated Learning

    Authors: Jianxiang Zhao, Xiangman Li, Jianbing Ni

    Abstract: We present a novel privacy-preserving model aggregation for asynchronous federated learning, named PPA-AFL that removes the restriction of synchronous aggregation of local model updates in federated learning, while enabling the protection of the local model updates against the server. In PPA-AFL, clients can proactive decide when to engage in the training process, and sends local model updates to… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

  45. arXiv:2305.14380  [pdf, other

    cs.LG cs.CL

    Finding the Pillars of Strength for Multi-Head Attention

    Authors: **jie Ni, Rui Mao, Zonglin Yang, Han Lei, Erik Cambria

    Abstract: Recent studies have revealed some issues of Multi-Head Attention (MHA), e.g., redundancy and over-parameterization. Specifically, the heads of MHA were originally designed to attend to information from different representation subspaces, whereas prior studies found that some attention heads likely learn similar features and can be pruned without harming performance. Inspired by the minimum-redunda… ▽ More

    Submitted 15 October, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL 2023)

    ACM Class: I.2.0; I.2.7

  46. arXiv:2305.14007  [pdf, other

    cs.CL

    When Does Aggregating Multiple Skills with Multi-Task Learning Work? A Case Study in Financial NLP

    Authors: **gwei Ni, Zhi**g **, Qian Wang, Mrinmaya Sachan, Markus Leippold

    Abstract: Multi-task learning (MTL) aims at achieving a better model by leveraging data and knowledge from multiple tasks. However, MTL does not always work -- sometimes negative transfer occurs between tasks, especially when aggregating loosely related skills, leaving it an open question when MTL works. Previous studies show that MTL performance can be improved by algorithmic tricks. However, what tasks an… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  47. arXiv:2305.06474  [pdf, other

    cs.IR cs.LG

    Do LLMs Understand User Preferences? Evaluating LLMs On User Rating Prediction

    Authors: Wang-Cheng Kang, Jianmo Ni, Nikhil Mehta, Maheswaran Sathiamoorthy, Lichan Hong, Ed Chi, Derek Zhiyuan Cheng

    Abstract: Large Language Models (LLMs) have demonstrated exceptional capabilities in generalizing to new tasks in a zero-shot or few-shot manner. However, the extent to which LLMs can comprehend user preferences based on their previous behavior remains an emerging and still unclear research question. Traditionally, Collaborative Filtering (CF) has been the most effective method for these tasks, predominantl… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

  48. arXiv:2305.05432  [pdf, other

    cs.CL cs.CV

    WikiWeb2M: A Page-Level Multimodal Wikipedia Dataset

    Authors: Andrea Burns, Krishna Srinivasan, Joshua Ainslie, Geoff Brown, Bryan A. Plummer, Kate Saenko, Jianmo Ni, Mandy Guo

    Abstract: Webpages have been a rich resource for language and vision-language tasks. Yet only pieces of webpages are kept: image-caption pairs, long text articles, or raw HTML, never all in one place. Webpage tasks have resultingly received little attention and structured image-text data underused. To study multimodal webpage understanding, we introduce the Wikipedia Webpage 2M (WikiWeb2M) suite; the first… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: Accepted at the WikiWorkshop 2023. Data is readily available at https://github.com/google-research-datasets/wit/blob/main/wikiweb2m.md. arXiv admin note: text overlap with arXiv:2305.03668

  49. arXiv:2305.03668  [pdf, other

    cs.CL cs.CV

    A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding

    Authors: Andrea Burns, Krishna Srinivasan, Joshua Ainslie, Geoff Brown, Bryan A. Plummer, Kate Saenko, Jianmo Ni, Mandy Guo

    Abstract: Webpages have been a rich, scalable resource for vision-language and language only tasks. Yet only pieces of webpages are kept in existing datasets: image-caption pairs, long text articles, or raw HTML, never all in one place. Webpage tasks have resultingly received little attention and structured image-text data left underused. To study multimodal webpage understanding, we introduce the Wikipedia… ▽ More

    Submitted 20 October, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: Accepted in EMNLP 2023, revision contains camera ready edits. Data can be downloaded at https://github.com/google-research-datasets/wit/blob/main/wikiweb2m.md

  50. arXiv:2304.05510  [pdf, other

    cs.CL

    chatClimate: Grounding Conversational AI in Climate Science

    Authors: Saeid Ashraf Vaghefi, Qian Wang, Veruska Muccione, **gwei Ni, Mathias Kraus, Julia Bingler, Tobias Schimanski, Chiara Colesanti-Senni, Nicolas Webersinke, Christrian Huggel, Markus Leippold

    Abstract: Large Language Models (LLMs) have made significant progress in recent years, achieving remarkable results in question-answering tasks (QA). However, they still face two major challenges: hallucination and outdated information after the training phase. These challenges take center stage in critical domains like climate change, where obtaining accurate and up-to-date information from reliable source… ▽ More

    Submitted 28 April, 2023; v1 submitted 11 April, 2023; originally announced April 2023.