Skip to main content

Showing 1–50 of 161 results for author: Bing, L

.
  1. arXiv:2406.17294  [pdf, other

    cs.CL

    Math-LLaVA: Bootstrap** Mathematical Reasoning for Multimodal Large Language Models

    Authors: Wenhao Shi, Zhiqiang Hu, Yi Bin, Junhua Liu, Yang Yang, See-Kiong Ng, Lidong Bing, Roy Ka-Wei Lee

    Abstract: Large language models (LLMs) have demonstrated impressive reasoning capabilities, particularly in textual mathematical problem-solving. However, existing open-source image instruction fine-tuning datasets, containing limited question-answer pairs per image, do not fully exploit visual information to enhance the multimodal mathematical reasoning capabilities of Multimodal LLMs (MLLMs). To bridge th… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

    Comments: 8 pages

  2. arXiv:2406.07476  [pdf, other

    cs.CV cs.CL

    VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

    Authors: Zesen Cheng, Sicong Leng, Hang Zhang, Yifei Xin, Xin Li, Guanzheng Chen, Yongxin Zhu, Wenqi Zhang, Ziyang Luo, Deli Zhao, Lidong Bing

    Abstract: In this paper, we present the VideoLLaMA 2, a set of Video Large Language Models (Video-LLMs) designed to enhance spatial-temporal modeling and audio understanding in video and audio-oriented tasks. Building upon its predecessor, VideoLLaMA 2 incorporates a tailor-made Spatial-Temporal Convolution (STC) connector, which effectively captures the intricate spatial and temporal dynamics of video data… ▽ More

    Submitted 17 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: ZC, SL, HZ, YX, and XL contributed equally to this project

  3. arXiv:2405.20267  [pdf, other

    cs.CL

    Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions

    Authors: Ruochen Zhao, Wenxuan Zhang, Yew Ken Chia, Deli Zhao, Lidong Bing

    Abstract: As LLMs evolve on a daily basis, there is an urgent need for a trustworthy evaluation method that can provide robust evaluation results in a timely fashion. Currently, as static benchmarks are prone to contamination concerns, users tend to trust human voting platforms, such as Chatbot Arena. However, human annotations require extensive manual efforts. To provide an automatic, robust, and trustwort… ▽ More

    Submitted 12 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  4. arXiv:2405.18290  [pdf, other

    astro-ph.GA

    Probabilistic and progressive deblended far-infrared and sub-millimetre point source catalogues I. Methodology and first application in the COSMOS field

    Authors: Lingyu Wang, Antonio La Marca, Fangyou Gao, William J. Pearson, Berta Margalef-Bentabol, Matthieu Béthermin, Longji Bing, James Donnellan, Peter D. Hurley, Seb J. Oliver, Catherine L. Hale, Matt J. Jarvis, Lucia Marchetti, Mattia Vaccari, Imogen H. Whittam

    Abstract: Single-dish far-infrared (far-IR) and sub-millimetre (sub-mm) point source catalogues and their connections with catalogues at other wavelengths are of paramount importance. However, due to the large mismatch in spatial resolution, cross-matching galaxies at different wavelengths is challenging. This work aims to develop the next-generation deblended far-IR and sub-mm catalogues and present the fi… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 23 pages, 30 figures, accepted for publication in A&A. Catalogues can be downloaded from https://hedam.lam.fr/HELP/dataproducts/dmu26/dmu26_XID+COSMOS2024/

  5. arXiv:2404.12872  [pdf, other

    cs.DB cs.CL

    LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency

    Authors: Zhaodonghui Li, Haitao Yuan, Huiming Wang, Gao Cong, Lidong Bing

    Abstract: Query rewrite, which aims to generate more efficient queries by altering a SQL query's structure without changing the query result, has been an important research problem. In order to maintain equivalence between the rewritten query and the original one during rewriting, traditional query rewrite methods always rewrite the queries following certain rewrite rules. However, some problems still remai… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 12 pages

  6. arXiv:2404.06935  [pdf, other

    astro-ph.IM astro-ph.GA

    Overcoming Confusion Noise with Hyperspectral Imaging from PRIMAger

    Authors: James M. S. Donnellan, Seb J. Oliver, Matthieu Bethermin, Longji Bing, Alberto Bolatto, Charles M. Bradford, Denis Burgarella, Laure Ciesla, Jason Glenn, Alexandra Pope, Stephen Serjeant, Raphael Shirley, JD T. Smith, Chris Sorrell

    Abstract: The PRobe far-Infrared Mission for Astrophysics (PRIMA) concept aims to perform map** with spectral coverage and sensitivities inaccessible to previous FIR space telescopes. PRIMA's imaging instrument, PRIMAger, provides unique hyperspectral imaging simultaneously covering 25-235 $μ$m. We synthesise images representing a deep, 1500 hr deg$^{-2}$ PRIMAger survey, with realistic instrumental and c… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 14 pages, 11 figures

  7. arXiv:2404.00570  [pdf, other

    cs.CL

    ParaICL: Towards Robust Parallel In-Context Learning

    Authors: Xingxuan Li, Xuan-Phi Nguyen, Shafiq Joty, Lidong Bing

    Abstract: Large language models (LLMs) have become the norm in natural language processing (NLP), excelling in few-shot in-context learning (ICL) with their remarkable abilities. Nonetheless, the success of ICL largely hinges on the choice of few-shot demonstration examples, making the selection process increasingly crucial. Existing methods have delved into optimizing the quantity and semantic similarity o… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Work in progress

  8. arXiv:2403.13315  [pdf, other

    cs.CV

    PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns

    Authors: Yew Ken Chia, Vernon Toh Yan Han, Deepanway Ghosal, Lidong Bing, Soujanya Poria

    Abstract: Large multimodal models extend the impressive capabilities of large language models by integrating multimodal understanding abilities. However, it is not clear how they can emulate the general intelligence and reasoning ability of humans. As recognizing patterns and abstracting concepts are key to general intelligence, we introduce PuzzleVQA, a collection of puzzles based on abstract patterns. Wit… ▽ More

    Submitted 30 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  9. arXiv:2403.10258  [pdf, other

    cs.CL

    Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models

    Authors: Chaoqun Liu, Wenxuan Zhang, Yiran Zhao, Anh Tuan Luu, Lidong Bing

    Abstract: Large language models (LLMs) have demonstrated multilingual capabilities; yet, they are mostly English-centric due to the imbalanced training corpora. Existing works leverage this phenomenon to improve their multilingual performances through translation, primarily on natural language processing (NLP) tasks. This work extends the evaluation from NLP tasks to real user queries and from English-centr… ▽ More

    Submitted 20 June, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 19 pages

  10. arXiv:2403.00911  [pdf, other

    astro-ph.GA

    Faint millimeter NIKA2 dusty star-forming galaxies: finding the high-redshift population

    Authors: L. -J. Bing, A. Beelen, G. Lagache, R. Adam, P. Ade, H. Ajeddig, P. André, E. Artis, H. Aussel, A. Benoît, S. Berta, M. Béthermin, O. Bourrion, M. Calvo, A. Catalano, M. De Petris, F. -X. Désert, S. Doyle, E. F. C. Driessen, A. Gomez, J. Goupy, F. Kéruzoré, C. Kramer, B. Ladjelate, S. Leclercq , et al. (24 additional authors not shown)

    Abstract: We develop a new framework to constrain the source redshift. The method jointly accounts for the detection/non-detection of spectral lines and the prior information from the photometric redshift and total infrared luminosity from spectral energy distribution analysis. The method uses the estimated total infrared luminosity to predict the line fluxes at given redshifts and generates model spectra.… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: A&A in press

  11. arXiv:2402.18913  [pdf, other

    cs.CL cs.AI

    AdaMergeX: Cross-Lingual Transfer with Large Language Models via Adaptive Adapter Merging

    Authors: Yiran Zhao, Wenxuan Zhang, Huiming Wang, Kenji Kawaguchi, Lidong Bing

    Abstract: As an effective alternative to the direct fine-tuning on target tasks in specific languages, cross-lingual transfer addresses the challenges of limited training data by decoupling ''task ability'' and ''language ability'' by fine-tuning on the target task in the source language and another selected task in the target language, respectively. However, they fail to fully separate the task ability fro… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  12. arXiv:2402.18815  [pdf, other

    cs.CL cs.AI

    How do Large Language Models Handle Multilingualism?

    Authors: Yiran Zhao, Wenxuan Zhang, Guizhen Chen, Kenji Kawaguchi, Lidong Bing

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities across diverse languages. This study explores how LLMs handle multilingualism. Based on observed language ratio shifts among layers and the relationships between network structures and certain capabilities, we hypothesize the LLM's multilingual workflow ($\texttt{MWork}$): LLMs initially understand the query, converting multili… ▽ More

    Submitted 24 May, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  13. arXiv:2312.00738  [pdf, other

    cs.CL

    SeaLLMs -- Large Language Models for Southeast Asia

    Authors: Xuan-Phi Nguyen, Wenxuan Zhang, Xin Li, Mahani Aljunied, Zhiqiang Hu, Chenhui Shen, Yew Ken Chia, Xingxuan Li, Jianyu Wang, Qingyu Tan, Liying Cheng, Guanzheng Chen, Yue Deng, Sen Yang, Chaoqun Liu, Hang Zhang, Lidong Bing

    Abstract: Despite the remarkable achievements of large language models (LLMs) in various tasks, there remains a linguistic bias that favors high-resource languages, such as English, often at the expense of low-resource and regional languages. To address this imbalance, we introduce SeaLLMs, an innovative series of language models that specifically focuses on Southeast Asian (SEA) languages. SeaLLMs are buil… ▽ More

    Submitted 1 July, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Technical report, ACL 2024 DEMO TRACK

  14. arXiv:2311.16922  [pdf, other

    cs.CV cs.AI cs.CL

    Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

    Authors: Sicong Leng, Hang Zhang, Guanzheng Chen, Xin Li, Shijian Lu, Chunyan Miao, Lidong Bing

    Abstract: Large Vision-Language Models (LVLMs) have advanced considerably, intertwining visual recognition and language understanding to generate content that is not only coherent but also contextually attuned. Despite their success, LVLMs still suffer from the issue of object hallucinations, where models generate plausible yet incorrect outputs that include objects that do not exist in the images. To mitig… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  15. arXiv:2311.09821  [pdf, other

    cs.CL

    Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction Tuning

    Authors: Qingyu Tan, Hwee Tou Ng, Lidong Bing

    Abstract: Knowledge in the real world is being updated constantly. However, it is costly to frequently update large language models (LLMs). Therefore, it is crucial for LLMs to understand the concept of temporal knowledge. However, prior works on temporal question answering did not emphasize multi-answer and multi-hop types of temporal reasoning. In this paper, we propose a complex temporal question-answeri… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  16. arXiv:2311.09802  [pdf, other

    cs.AI cs.CL

    Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs

    Authors: Sen Yang, Xin Li, Leyang Cui, Lidong Bing, Wai Lam

    Abstract: Though prompting LLMs with various reasoning structures produces reasoning proofs along with answers, these proofs are not ensured to be causal and reliable due to the inherent defects of LLMs. Tracking such deficiencies, we present a neuro-symbolic integration method, in which a neural LLM is used to represent the knowledge of the problem while an LLM-free symbolic solver is adopted to do deliber… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  17. arXiv:2311.09277  [pdf, other

    cs.CL

    Contrastive Chain-of-Thought Prompting

    Authors: Yew Ken Chia, Guizhen Chen, Luu Anh Tuan, Soujanya Poria, Lidong Bing

    Abstract: Despite the success of chain of thought in enhancing language model reasoning, the underlying process remains less well understood. Although logically sound reasoning appears inherently crucial for chain of thought, prior studies surprisingly reveal minimal impact when using invalid demonstrations instead. Furthermore, the conventional chain of thought does not inform language models on what mista… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  18. arXiv:2311.09022  [pdf, other

    cs.CL

    Exploring the Potential of Large Language Models in Computational Argumentation

    Authors: Guizhen Chen, Liying Cheng, Luu Anh Tuan, Lidong Bing

    Abstract: Computational argumentation has become an essential tool in various domains, including law, public policy, and artificial intelligence. It is an emerging research field in natural language processing that attracts increasing attention. Research on computational argumentation mainly involves two types of tasks: argument mining and argument generation. As large language models (LLMs) have demonstrat… ▽ More

    Submitted 1 July, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted at ACL 2024 Main

  19. arXiv:2311.02205  [pdf, other

    cs.CL

    An Introduction to Natural Language Processing Techniques and Framework for Clinical Implementation in Radiation Oncology

    Authors: Reza Khanmohammadi, Mohammad M. Ghassemi, Kyle Verdecchia, Ahmed I. Ghanem, Luo Bing, Indrin J. Chetty, Hassan Bagher-Ebadian, Farzan Siddiqui, Mohamed Elshaikh, Benjamin Movsas, Kundan Thind

    Abstract: Natural Language Processing (NLP) is a key technique for develo** Medical Artificial Intelligence (AI) systems that leverage Electronic Health Record (EHR) data to build diagnostic and prognostic models. NLP enables the conversion of unstructured clinical text into structured data that can be fed into AI algorithms. The emergence of the transformer architecture and large language models (LLMs) h… ▽ More

    Submitted 8 November, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

  20. arXiv:2310.17924  [pdf, other

    cs.CL

    SOUL: Towards Sentiment and Opinion Understanding of Language

    Authors: Yue Deng, Wenxuan Zhang, Sinno Jialin Pan, Lidong Bing

    Abstract: Sentiment analysis is a well-established natural language processing task, with sentiment polarity classification being one of its most popular and representative tasks. However, despite the success of pre-trained language models in this area, they often fall short of capturing the broader complexities of sentiment analysis. To address this issue, we propose a new task called Sentiment and Opinion… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Main Conference, Short Paper

  21. arXiv:2310.16526  [pdf, other

    astro-ph.SR astro-ph.GA

    NIKA2 observations of dust grain evolution from star-forming filament to T-Tauri disk: Preliminary results from NIKA2 observations of the Taurus B211/B213 filament

    Authors: Q. Nguyen-Luong, R. Adam, P. Ade, H. Ajeddig, P. André, E. Artis, H. Aussel, A. Beelen, A. Benoît, S. Berta, L. Bing, O. Bourrion, M. Calvo, A. Catalano, M. De Petris, F. -X. Désert, S. Doyle, E. F. C. Driessen, G. Ejlali, A. Gomez, J. Goupy, C. Hanser, S. Katsioli, F. Kéruzoré, C. Kramer , et al. (29 additional authors not shown)

    Abstract: To understand the evolution of dust properties in molecular clouds in the course of the star formation process, we constrain the changes in the dust emissivity index from star-forming filaments to prestellar and protostellar cores to T Tauri stars. Using the NIKA2 continuum camera on the IRAM 30~m telescope, we observed the Taurus B211/B213 filament at 1.2\,mm and 2\,mm with unprecedented sensitiv… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: to appear in Proc. of the mm Universe 2023 conference, Grenoble (France), June 2023, published by F. Mayet et al. (Eds), EPJ Web of conferences, EDP Sciences

    Report number: 00035

    Journal ref: https://ui.adsabs.harvard.edu/abs/2023arXiv231016526N/abstract

  22. arXiv:2310.16450  [pdf, other

    cs.CL

    CLEX: Continuous Length Extrapolation for Large Language Models

    Authors: Guanzheng Chen, Xin Li, Zaiqiao Meng, Shangsong Liang, Lidong Bing

    Abstract: Transformer-based Large Language Models (LLMs) are pioneering advances in many natural language processing tasks, however, their exceptional capabilities are restricted within the preset context window of Transformer. Position Embedding (PE) scaling methods, while effective in extending the context window to a specific length, demonstrate either notable limitations in their extrapolation abilities… ▽ More

    Submitted 24 March, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  23. arXiv:2310.14709  [pdf, other

    cs.CL

    Once Upon a $\textit{Time}$ in $\textit{Graph}$: Relative-Time Pretraining for Complex Temporal Reasoning

    Authors: Sen Yang, Xin Li, Lidong Bing, Wai Lam

    Abstract: Our physical world is constantly evolving over time, rendering challenges for pre-trained language models to understand and reason over the temporal contexts of texts. Existing work focuses on strengthening the direct association between a piece of text and its time-stamp. However, the knowledge-time association is usually insufficient for the downstream tasks that require reasoning over temporal… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 main

  24. arXiv:2310.10962  [pdf, other

    cs.CL

    Large Language Models can Contrastively Refine their Generation for Better Sentence Representation Learning

    Authors: Huiming Wang, Zhaodonghui Li, Liying Cheng, Soh De Wen, Lidong Bing

    Abstract: Recently, large language models (LLMs) have emerged as a groundbreaking technology and their unparalleled text generation capabilities have sparked interest in their application to the fundamental sentence representation learning task. Existing methods have explored utilizing LLMs as data annotators to generate synthesized data for training contrastive learning based sentence embedding models such… ▽ More

    Submitted 17 May, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: NAACL 2024

  25. arXiv:2310.07400  [pdf, other

    astro-ph.CO

    Towards the first mean pressure profile estimate with the NIKA2 Sunyaev-Zeldovich Large Program

    Authors: C. Hanser, R. Adam, P. Ade, H. Ajeddig, P. André, E. Artis, H. Aussel, I. Bartalucci, A. Beelen, A. Benoît, S. Berta, L. Bing, O. Bourrion, M. Calvo, A. Catalano, M. De Petris, F. -X. Désert, S. Doyle, E. F. C. Driessen, G. Ejlali, A. Ferragamo, A. Gomez, J. Goupy, S. Katsioli, F. Kéruzoré , et al. (29 additional authors not shown)

    Abstract: High-resolution map** of the hot gas in galaxy clusters is a key tool for cluster-based cosmological analyses. Taking advantage of the NIKA2 millimeter camera operated at the IRAM 30-m telescope, the NIKA2 SZ Large Program seeks to get a high-resolution follow-up of 38 galaxy clusters covering a wide mass range at intermediate to high redshift. The measured SZ fluxes will be essential to calibra… ▽ More

    Submitted 13 December, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: to appear in Proc. of the mm Universe 2023 conference, Grenoble (France), June 2023, published by F. Mayet et al. (Eds), EPJ Web of conferences, EDP Sciences

  26. arXiv:2310.06474  [pdf, other

    cs.CL

    Multilingual Jailbreak Challenges in Large Language Models

    Authors: Yue Deng, Wenxuan Zhang, Sinno Jialin Pan, Lidong Bing

    Abstract: While large language models (LLMs) exhibit remarkable capabilities across a wide range of tasks, they pose potential safety concerns, such as the ``jailbreak'' problem, wherein malicious instructions can manipulate LLMs to exhibit undesirable behavior. Although several preventive measures have been developed to mitigate the potential risks associated with LLMs, they have primarily focused on Engli… ▽ More

    Submitted 3 March, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  27. IAS/CEA Evolution of Dust in Nearby Galaxies (ICED): the spatially-resolved dust properties of NGC4254

    Authors: L. Pantoni, R. Adam, P. Ade, H. Ajeddig, P. André, E. Artis, H. Aussel, M. Baes, A. Beelen, A. Benoît, S. Berta, L. Bing, O. Bourrion, M. Calvo, A. Catalano, M. De Petris, F. -X. Désert, S. Doyle, E. F. C. Driessen, G. Ejlali, F. Galliano, A. Gomez, J. Goupy, A. P. Jones, C. Hanser , et al. (35 additional authors not shown)

    Abstract: We present the first preliminary results of the project \textit{ICED}, focusing on the face-on galaxy NGC4254. We use the millimetre maps observed with NIKA2 at IRAM-30m, as part of the IMEGIN Guaranteed Time Large Program, and of a wide collection of ancillary data (multi-wavelength photometry and gas phase spectral lines) that are publicly available. We derive the global and local properties of… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: to appear in Proc. of the mm Universe 2023 conference, Grenoble (France), June 2023, published by F. Mayet et al. (Eds), EPJ Web of conferences, EDP Sciences

    Journal ref: EPJ Web of Conferences 293 (2024) 00038

  28. arXiv:2310.06373  [pdf, other

    astro-ph.CO

    NIKA2 observations of 3 low-mass galaxy clusters at $z \sim 1$: pressure profile and $Y_{\rm SZ}$-$M$ relation

    Authors: R. Adam, M. Ricci, D. Eckert, P. Ade, H. Ajeddig, B. Altieri, P. André, E. Artis, H. Aussel, A. Beelen, C. Benoist, A. Benoît, S. Berta, L. Bing, M. Birkinshaw, O. Bourrion, D. Boutigny, M. Bremer, M. Calvo, A. Cappi, A. Catalano, M. De Petris, F. -X. Désert, S. Doyle, E. F. C. Driessen , et al. (42 additional authors not shown)

    Abstract: Three galaxy clusters selected from the XXL X-ray survey at high redshift and low mass ($z\sim1$ and $M_{500} \sim 1-2 \times 10^{14}$ M$_{\odot}$) were observed with NIKA2 to image their Sunyaev-Zel'dovich effect (SZ) signal. They all present an SZ morphology, together with the comparison with X-ray and optical data, that indicates dynamical activity related to merging events. Despite their distu… ▽ More

    Submitted 13 October, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: to appear in Proc. of the mm Universe 2023 conference, Grenoble (France), June 2023, published by F. Mayet et al. (Eds), EPJ Web of conferences, EDP Sciences

  29. arXiv:2310.05819  [pdf, other

    astro-ph.CO astro-ph.GA

    The XXL Survey LI. Pressure profile and $Y_{\rm SZ}$-$M$ scaling relation in three low-mass galaxy clusters at $z\sim1$ observed with NIKA2

    Authors: R. Adam, M. Ricci, D. Eckert, P. Ade, H. Ajeddig, B. Altieri, P. André, E. Artis, H. Aussel, A. Beelen, C. Benoist, A. Benoît, S. Berta, L. Bing, M. Birkinshaw, O. Bourrion, D. Boutigny, M. Bremer, M. Calvo, A. Cappi, A. Catalano, M. De Petris, F. -X. Désert, S. Doyle, E. F. C. Driessen , et al. (42 additional authors not shown)

    Abstract: The thermodynamical properties of the intracluster medium (ICM) are driven by scale-free gravitational collapse, but they also reflect the rich astrophysical processes at play in galaxy clusters. At low masses ($\sim 10^{14}$ M$_{\odot}$) and high redshift ($z \gtrsim 1$), these properties remain poorly constrained observationally, due to the difficulty in obtaining resolved and sensitive data. Th… ▽ More

    Submitted 28 March, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: 24 pages, published in A&A. Note that the title number had to be changed. This version matches the one from A&A

    Journal ref: A&A 684, A18 (2024)

  30. The NIKA2 Sunyaev-Zeldovich Large Program: Sample and upcoming product public release

    Authors: L. Perotto, R. Adam, P. Ade, H. Ajeddig, P. André, E. Artis, H. Aussel, R. Barrena, I. Bartalucci, A. Beelen, A. Benoît, S. Berta, L. Bing, O. Bourrion, M. Calvo, A. Catalano, M. De Petris, F. -X. Désert, S. Doyle, E. F. C. Driessen, G. Ejlali, A. Ferragamo, A. Gomez, J. Goupy, C. Hanser , et al. (30 additional authors not shown)

    Abstract: The NIKA2 camera operating at the IRAM 30 m telescope excels in high-angular resolution map** of the thermal Sunyaev-Zeldovich effect towards galaxy clusters at intermediate and high-redshift. As part of the NIKA2 guaranteed time, the SZ Large Program (LPSZ) aims at tSZ-map** a representative sample of SZ-selected galaxy clusters in the catalogues of the Planck satellite and of the Atacama Cos… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: to appear in Proc. of the mm Universe 2023 conference, Grenoble (France), June 2023, published by F. Mayet et al. (Eds), EPJ Web of conferences, EDP Sciences

    Journal ref: EPJ Web of Conferences 293 (2024) 00040

  31. Exploring the interstellar medium of NGC 891 at millimeter wavelengths using the NIKA2 camera

    Authors: S. Katsioli, R. Adam, P. Ade, H. Ajeddig, P. André, E. Artis, H. Aussel, M. Baes, A. Beelen, A. Benoît, S. Berta, L. Bing, O. Bourrion, M. Calvo, A. Catalano, C. J. R. Clark, I. De Looze, M. De Petris, F. -X. Désert, S. Doyle, E. F. C. Driessen, G. Ejlali, M. Galametz, F. Galliano, A. Gomez , et al. (39 additional authors not shown)

    Abstract: In the framework of the IMEGIN Large Program, we used the NIKA2 camera on the IRAM 30-m telescope to observe the edge-on galaxy NGC 891 at 1.15 mm and 2 mm and at a FWHM of 11.1" and 17.6", respectively. Multiwavelength data enriched with the new NIKA2 observations fitted by the HerBIE SED code (coupled with the THEMIS dust model) were used to constrain the physical properties of the ISM. Emission… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: To appear in Proc. of the mm Universe 2023 conference, Grenoble (France), June 2023, published by F. Mayet et al. (Eds), EPJ Web of conferences, EDP Sciences

    Journal ref: EPJ Web of conferences 293 (2024) 00026

  32. Constraining Millimeter Dust Emission in Nearby Galaxies with NIKA2: the case of NGC2146 and NGC2976

    Authors: G. Ejlali, R. Adam, P. Ade, H. Ajeddig, P. André, E. Artis, H. Aussel, M. Baes, A. Beelen, Benoît, S. Berta, L. Bing, O. Bourrion, M. Calvo, A. Catalano, M. De Petris, F. -X. Désert, S. Doyle, E. F. C. Driessen, F. Galliano, A. Gomez, J. Goupy, A. P. Jones, C. Hanser, A. Hughes , et al. (35 additional authors not shown)

    Abstract: This study presents the first millimeter continuum map** observations of two nearby galaxies, the starburst spiral galaxy NGC2146 and the dwarf galaxy NGC2976, at 1.15 mm and 2 mm using the NIKA2 camera on the IRAM 30m telescope, as part of the Guaranteed Time Large Project IMEGIN. These observations provide robust resolved information about the physical properties of dust in nearby galaxies by… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: To appear in Proc. of the mm Universe 2023 conference, Grenoble (France), June 2023, published by F. Mayet et al. (Eds), EPJ Web of conferences, EDP Sciences

    Journal ref: EPJ Web of conferences 293 (2024) 00016

  33. Systematic effects on the upcoming NIKA2 LPSZ scaling relation

    Authors: A. Moyer-Anin, R. Adam, P. Ade, H. Ajeddig, P. André, E. Artis, H. Aussel, I. Bartalucci, A. Beelen, A. Benoît, S. Berta, L. Bing, O. Bourrion, M. Calvo, A. Catalano, M. De Petris, F. -X. Désert, S. Doyle, E. F. C. Driessen, G. Ejlali, A. Gomez, J. Goupy, C. Hanser, S. Katsioli, F. Kéruzoré , et al. (27 additional authors not shown)

    Abstract: In cluster cosmology, cluster masses are the main parameter of interest. They are needed to constrain cosmological parameters through the cluster number count. As the mass is not an observable, a scaling relation is needed to link cluster masses to the integrated Compton parameters Y, i.e. the Sunyaev-Zeldovich observable (SZ). Planck cosmological results obtained with cluster number counts are ba… ▽ More

    Submitted 7 December, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: to appear in Proc. of the mm Universe 2023 conference, Grenoble (France), June 2023, published by F. Mayet et al. (Eds), EPJ Web of conferences, EDP Sciences

    Journal ref: EPJ Web of Conferences 293 (2024) 00032

  34. NIKA2 observations of starless cores in Taurus and Perseus

    Authors: C. Kramer, R. Adam, P. Ade, H. Ajeddig, P. Andre, E. Artis, H. Aussel, A. Beelen, A. Beno, S. Berta, L. Bing, O. Bourrion, M. Calvo, P. Caselli, A. Catalano, M. DePetris, F. -X. Desert, S. Doyle, E. F. C. Driessen, G. Ejlali, A. Fuente, A. Gomez, J. Goupy, C. Hanser, S. Katsioli , et al. (27 additional authors not shown)

    Abstract: Dusty starless cores play an important role in regulating the initial phases of the formation of stars and planets. In their interiors, dust grains coagulate and ice mantles form, thereby changing the millimeter emissivities and hence the ability to cool. We mapped four regions with more than a dozen cores in the nearby Galactic filaments of Taurus and Perseus using the NIKA2 camera at the IRAM 30… ▽ More

    Submitted 4 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: to appear in Proc. of the mm Universe 2023 conference, Grenoble (France), June 2023, published by F. Mayet et al. (Eds), EPJ Web of conferences, EDP Sciences

    Journal ref: EPJ Web of conferences 293 (2024) 00027

  35. The stratification of ISM properties in the edge-on galaxy NGC 891 revealed by NIKA2

    Authors: S. Katsioli, E. M. Xilouris, C. Kramer, R. Adam, P. Ade, H. Ajeddig, P. André, E. Artis, H. Aussel, M. Baes, A. Beelen, A. Benoît, S. Berta, L. Bing, O. Bourrion, M. Calvo, A. Catalano, C. J. R. Clark, I. De Looze, M. De Petris, F. -X. Désert, S. Doyle, E. F. C. Driessen, G. Ejlali, M. Galametz , et al. (38 additional authors not shown)

    Abstract: As the millimeter wavelength range remains a largely unexplored spectral region for galaxies, the IMEGIN large program aims to map the millimeter continuum emission of 22 nearby galaxies at 1.15 and 2 mm. Using the high-resolution maps produced by the NIKA2 camera, we explore the existence of very cold dust and take possible contamination by free-free and synchrotron emission into account. We stud… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 15 pages, 10 figures, Accepted for publication in A&A

    Journal ref: A&A 679, A7 (2023)

  36. arXiv:2309.02492  [pdf, other

    astro-ph.GA

    Massive Optically Dark Galaxies Unveiled by JWST Challenge Galaxy Formation Models

    Authors: Mengyuan Xiao, Pascal Oesch, David Elbaz, Longji Bing, Erica Nelson, Andrea Weibel, Rohan Naidu, Emanuele Daddi, Rychard Bouwens, Jorryt Matthee, Stijn Wuyts, John Chisholm, Gabriel Brammer, Mark Dickinson, Benjamin Magnelli, Lucas Leroy, Pieter van Dokkum, Daniel Schaerer, Thomas Herard-Demanche, Laia Barrufet, Ryan Endsley, Yoshinobu Fudamoto, Carlos Gómez-Guijarro, Rashmi Gottumukkala, Garth Illingworth , et al. (12 additional authors not shown)

    Abstract: Over the past decade, the existence of a substantial population of optically invisible, massive galaxies at $z\gtrsim3$ has been implied from mid-infrared to millimeter observations. With the unprecedented sensitivity of the JWST, such extremely massive galaxy candidates have immediately been identified even at $z>7$, in much larger numbers than expected. These discoveries raised a hot debate. If… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Submitted to Nature. 22 pages, 4 main figures, 7 supplementary figures, 3 supplementary tables. Comments are welcome

  37. arXiv:2306.11372  [pdf, other

    cs.CL cs.AI

    Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts

    Authors: Xuan-Phi Nguyen, Sharifah Mahani Aljunied, Shafiq Joty, Lidong Bing

    Abstract: Large language models (LLMs) are known to effectively perform tasks by simply observing few exemplars. However, in low-resource languages, obtaining such hand-picked exemplars can still be challenging, where unsupervised techniques may be necessary. Moreover, competent generative capabilities of LLMs are observed only in high-resource languages, while their performances among under-represented lan… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Pre-print

  38. arXiv:2306.09697  [pdf, other

    cs.CL

    Class-Adaptive Self-Training for Relation Extraction with Incompletely Annotated Training Data

    Authors: Qingyu Tan, Lu Xu, Lidong Bing, Hwee Tou Ng

    Abstract: Relation extraction (RE) aims to extract relations from sentences and documents. Existing relation extraction models typically rely on supervised machine learning. However, recent studies showed that many RE datasets are incompletely annotated. This is known as the false negative problem in which valid relations are falsely annotated as 'no_relation'. Models trained with such data inevitably make… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: ACL 2023 Findings

  39. arXiv:2306.08952  [pdf, other

    cs.CL cs.AI

    Towards Benchmarking and Improving the Temporal Reasoning Capability of Large Language Models

    Authors: Qingyu Tan, Hwee Tou Ng, Lidong Bing

    Abstract: Reasoning about time is of fundamental importance. Many facts are time-dependent. For example, athletes change teams from time to time, and different government officials are elected periodically. Previous time-dependent question answering (QA) datasets tend to be biased in either their coverage of time spans or question types. In this paper, we introduce a comprehensive probing dataset \tempreaso… ▽ More

    Submitted 27 June, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: ACL 2023

  40. arXiv:2306.05179  [pdf, other

    cs.CL cs.CV

    M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models

    Authors: Wenxuan Zhang, Sharifah Mahani Aljunied, Chang Gao, Yew Ken Chia, Lidong Bing

    Abstract: Despite the existence of various benchmarks for evaluating natural language processing models, we argue that human exams are a more suitable means of evaluating general intelligence for large language models (LLMs), as they inherently demand a much wider range of abilities such as language understanding, domain knowledge, and problem-solving skills. To this end, we introduce M3Exam, a novel benchm… ▽ More

    Submitted 9 November, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 (Datasets and Benchmarks)

  41. arXiv:2306.04757  [pdf, other

    cs.CL cs.AI

    INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models

    Authors: Yew Ken Chia, Pengfei Hong, Lidong Bing, Soujanya Poria

    Abstract: Instruction-tuned large language models have revolutionized natural language processing and have shown great potential in applications such as conversational agents. These models, such as GPT-4, can not only master language but also solve complex tasks in areas like mathematics, coding, medicine, and law. Despite their impressive capabilities, there is still a lack of comprehensive understanding r… ▽ More

    Submitted 15 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Github: https://github.com/declare-lab/instruct-eval Leaderboard: https://declare-lab.github.io/instruct-eval/

  42. arXiv:2306.02858  [pdf, other

    cs.CL cs.CV cs.SD eess.AS

    Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

    Authors: Hang Zhang, Xin Li, Lidong Bing

    Abstract: We present Video-LLaMA a multi-modal framework that empowers Large Language Models (LLMs) with the capability of understanding both visual and auditory content in the video. Video-LLaMA bootstraps cross-modal training from the frozen pre-trained visual and audio encoders and the frozen LLMs. Unlike previous works that complement LLMs to process the visual or audio signals only, Video-LLaMA enables… ▽ More

    Submitted 25 October, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by EMNLP 2023's demo track; Code, Pretrained Model, and Dataset: https://github.com/DAMO-NLP-SG/Video-LLaMA

  43. arXiv:2305.19902  [pdf, other

    cs.CL

    AQE: Argument Quadruplet Extraction via a Quad-Tagging Augmented Generative Approach

    Authors: Jia Guo, Liying Cheng, Wenxuan Zhang, Stanley Kok, Xin Li, Lidong Bing

    Abstract: Argument mining involves multiple sub-tasks that automatically identify argumentative elements, such as claim detection, evidence extraction, stance classification, etc. However, each subtask alone is insufficient for a thorough understanding of the argumentative structure and reasoning process. To learn a complete view of an argument essay and capture the interdependence among argumentative compo… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  44. arXiv:2305.15038  [pdf, other

    cs.CL

    Is GPT-4 a Good Data Analyst?

    Authors: Liying Cheng, Xingxuan Li, Lidong Bing

    Abstract: As large language models (LLMs) have demonstrated their powerful capabilities in plenty of domains and tasks, including context understanding, code generation, language generation, data storytelling, etc., many data analysts may raise concerns if their jobs will be replaced by artificial intelligence (AI). This controversial topic has drawn great attention in public. However, we are still at a sta… ▽ More

    Submitted 22 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 19 pages, 2 figures

  45. arXiv:2305.15014  [pdf, other

    cs.CL

    Unlocking Temporal Question Answering for Large Language Models Using Code Execution

    Authors: Xingxuan Li, Liying Cheng, Qingyu Tan, Hwee Tou Ng, Shafiq Joty, Lidong Bing

    Abstract: Large language models (LLMs) have made significant progress in natural language processing (NLP), and are utilized extensively in various applications. Recent works, such as chain-of-thought (CoT), have shown that intermediate reasoning steps can improve the performance of LLMs for complex reasoning tasks, such as math problems and symbolic question-answering tasks. However, we notice the challeng… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  46. arXiv:2305.15005  [pdf, other

    cs.CL

    Sentiment Analysis in the Era of Large Language Models: A Reality Check

    Authors: Wenxuan Zhang, Yue Deng, Bing Liu, Sinno Jialin Pan, Lidong Bing

    Abstract: Sentiment analysis (SA) has been a long-standing research area in natural language processing. It can offer rich insights into human sentiments and opinions and has thus seen considerable interest from both academia and industry. With the advent of large language models (LLMs) such as ChatGPT, there is a great potential for their employment on SA problems. However, the extent to which existing LLM… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  47. arXiv:2305.14434  [pdf, other

    cs.CL

    Domain-Expanded ASTE: Rethinking Generalization in Aspect Sentiment Triplet Extraction

    Authors: Yew Ken Chia, Hui Chen, Wei Han, Guizhen Chen, Sharifah Mahani Aljunied, Soujanya Poria, Lidong Bing

    Abstract: Aspect Sentiment Triplet Extraction (ASTE) is a subtask of Aspect-Based Sentiment Analysis (ABSA) that considers each opinion term, their expressed sentiment, and the corresponding aspect targets. However, existing methods are limited to the in-domain setting with two domains. Hence, we propose a domain-expanded benchmark to address the in-domain, out-of-domain and cross-domain settings. We suppor… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  48. arXiv:2305.13645  [pdf, other

    cs.CL

    mPMR: A Multilingual Pre-trained Machine Reader at Scale

    Authors: Weiwen Xu, Xin Li, Wai Lam, Lidong Bing

    Abstract: We present multilingual Pre-trained Machine Reader (mPMR), a novel method for multilingual machine reading comprehension (MRC)-style pre-training. mPMR aims to guide multilingual pre-trained language models (mPLMs) to perform natural language understanding (NLU) including both sequence classification and span extraction in multiple languages. To achieve cross-lingual generalization when only sourc… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: To appear at ACL 2023 main conference

  49. arXiv:2305.13628  [pdf, other

    cs.CL

    Improving Self-training for Cross-lingual Named Entity Recognition with Contrastive and Prototype Learning

    Authors: Ran Zhou, Xin Li, Lidong Bing, Erik Cambria, Chunyan Miao

    Abstract: In cross-lingual named entity recognition (NER), self-training is commonly used to bridge the linguistic gap by training on pseudo-labeled target-language data. However, due to sub-optimal performance on target languages, the pseudo labels are often noisy and limit the overall performance. In this work, we aim to improve self-training for cross-lingual NER by combining representation learning and… ▽ More

    Submitted 4 June, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL2023

  50. arXiv:2305.13269  [pdf, other

    cs.CL

    Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources

    Authors: Xingxuan Li, Ruochen Zhao, Yew Ken Chia, Bosheng Ding, Shafiq Joty, Soujanya Poria, Lidong Bing

    Abstract: We present chain-of-knowledge (CoK), a novel framework that augments large language models (LLMs) by dynamically incorporating grounding information from heterogeneous sources. It results in more factual rationales and reduced hallucination in generation. Specifically, CoK consists of three stages: reasoning preparation, dynamic knowledge adapting, and answer consolidation. Given a knowledge-inten… ▽ More

    Submitted 21 February, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted by ICLR 2024