-
Better Sampling of Negatives for Distantly Supervised Named Entity Recognition
Authors:
Lu Xu,
Lidong Bing,
Wei Lu
Abstract:
Distantly supervised named entity recognition (DS-NER) has been proposed to exploit the automatically labeled training data instead of human annotations. The distantly annotated datasets are often noisy and contain a considerable number of false negatives. The recent approach uses a weighted sampling approach to select a subset of negative samples for training. However, it requires a good classifi…
▽ More
Distantly supervised named entity recognition (DS-NER) has been proposed to exploit the automatically labeled training data instead of human annotations. The distantly annotated datasets are often noisy and contain a considerable number of false negatives. The recent approach uses a weighted sampling approach to select a subset of negative samples for training. However, it requires a good classifier to assign weights to the negative samples. In this paper, we propose a simple and straightforward approach for selecting the top negative samples that have high similarities with all the positive samples for training. Our method achieves consistent performance improvements on four distantly supervised NER datasets. Our analysis also shows that it is critical to differentiate the true negatives from the false negatives.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
Large Language Models are Not Yet Human-Level Evaluators for Abstractive Summarization
Authors:
Chenhui Shen,
Liying Cheng,
Xuan-Phi Nguyen,
Yang You,
Lidong Bing
Abstract:
With the recent undeniable advancement in reasoning abilities in large language models (LLMs) like ChatGPT and GPT-4, there is a growing trend for using LLMs on various tasks. One area where LLMs can be employed is as an alternative evaluation metric for complex generative tasks, which generally demands expensive human judges to complement the traditional automatic metrics for various evaluation d…
▽ More
With the recent undeniable advancement in reasoning abilities in large language models (LLMs) like ChatGPT and GPT-4, there is a growing trend for using LLMs on various tasks. One area where LLMs can be employed is as an alternative evaluation metric for complex generative tasks, which generally demands expensive human judges to complement the traditional automatic metrics for various evaluation dimensions such as fluency and consistency. In this work, we conduct extensive analysis to investigate the stability and reliability of LLMs as automatic evaluators for abstractive summarization. We found that while ChatGPT and GPT-4 outperform the commonly used automatic metrics, they are not ready as human replacements due to significant limitations. That is, LLM evaluators rate each candidate system inconsistently and are dimension-dependent. They also struggle to compare candidates with close performance and become more unreliable with higher-quality summaries by obtaining a lower correlation with humans. In other words, with better abstractive summarization systems being introduced at a fast pace, LLMs may result in misleading and unreliable evaluations.
△ Less
Submitted 19 October, 2023; v1 submitted 22 May, 2023;
originally announced May 2023.
-
Gradient-Boosted Decision Tree for Listwise Context Model in Multimodal Review Helpfulness Prediction
Authors:
Thong Nguyen,
Xiaobao Wu,
Xinshuai Dong,
Anh Tuan Luu,
Cong-Duy Nguyen,
Zhen Hai,
Lidong Bing
Abstract:
Multimodal Review Helpfulness Prediction (MRHP) aims to rank product reviews based on predicted helpfulness scores and has been widely applied in e-commerce via presenting customers with useful reviews. Previous studies commonly employ fully-connected neural networks (FCNNs) as the final score predictor and pairwise loss as the training objective. However, FCNNs have been shown to perform ineffici…
▽ More
Multimodal Review Helpfulness Prediction (MRHP) aims to rank product reviews based on predicted helpfulness scores and has been widely applied in e-commerce via presenting customers with useful reviews. Previous studies commonly employ fully-connected neural networks (FCNNs) as the final score predictor and pairwise loss as the training objective. However, FCNNs have been shown to perform inefficient splitting for review features, making the model difficult to clearly differentiate helpful from unhelpful reviews. Furthermore, pairwise objective, which works on review pairs, may not completely capture the MRHP goal to produce the ranking for the entire review list, and possibly induces low generalization during testing. To address these issues, we propose a listwise attention network that clearly captures the MRHP ranking context and a listwise optimization objective that enhances model generalization. We further propose gradient-boosted decision tree as the score predictor to efficaciously partition product reviews' representations. Extensive experiments demonstrate that our method achieves state-of-the-art results and polished generalization performance on two large-scale MRHP benchmark datasets.
△ Less
Submitted 25 May, 2023; v1 submitted 21 May, 2023;
originally announced May 2023.
-
Enhancing Few-shot NER with Prompt Ordering based Data Augmentation
Authors:
Huiming Wang,
Liying Cheng,
Wenxuan Zhang,
De Wen Soh,
Lidong Bing
Abstract:
Recently, data augmentation (DA) methods have been proven to be effective for pre-trained language models (PLMs) in low-resource settings, including few-shot named entity recognition (NER). However, conventional NER DA methods are mostly aimed at sequence labeling models, i.e., token-level classification, and few are compatible with unified autoregressive generation frameworks, which can handle a…
▽ More
Recently, data augmentation (DA) methods have been proven to be effective for pre-trained language models (PLMs) in low-resource settings, including few-shot named entity recognition (NER). However, conventional NER DA methods are mostly aimed at sequence labeling models, i.e., token-level classification, and few are compatible with unified autoregressive generation frameworks, which can handle a wider range of NER tasks, such as nested NER. Furthermore, these generation frameworks have a strong assumption that the entities will appear in the target sequence with the same left-to-right order as the source sequence. In this paper, we claim that there is no need to keep this strict order, and more diversified but reasonable target entity sequences can be provided during the training stage as a novel DA method. Nevertheless, a naive mixture of augmented data can confuse the model since one source sequence will then be paired with different target sequences. Therefore, we propose a simple but effective Prompt Ordering based Data Augmentation (PODA) method to improve the training of unified autoregressive generation frameworks under few-shot NER scenarios. Experimental results on three public NER datasets and further analyses demonstrate the effectiveness of our approach.
△ Less
Submitted 19 May, 2023;
originally announced May 2023.
-
Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling
Authors:
Shengqiong Wu,
Hao Fei,
Yixin Cao,
Lidong Bing,
Tat-Seng Chua
Abstract:
Existing research on multimodal relation extraction (MRE) faces two co-existing challenges, internal-information over-utilization and external-information under-exploitation. To combat that, we propose a novel framework that simultaneously implements the idea of internal-information screening and external-information exploiting. First, we represent the fine-grained semantic structures of the input…
▽ More
Existing research on multimodal relation extraction (MRE) faces two co-existing challenges, internal-information over-utilization and external-information under-exploitation. To combat that, we propose a novel framework that simultaneously implements the idea of internal-information screening and external-information exploiting. First, we represent the fine-grained semantic structures of the input image and text with the visual and textual scene graphs, which are further fused into a unified cross-modal graph (CMG). Based on CMG, we perform structure refinement with the guidance of the graph information bottleneck principle, actively denoising the less-informative features. Next, we perform topic modeling over the input image and text, incorporating latent multimodal topic features to enrich the contexts. On the benchmark MRE dataset, our system outperforms the current best model significantly. With further in-depth analyses, we reveal the great potential of our method for the MRE task. Our codes are open at https://github.com/ChocoWu/MRE-ISE.
△ Less
Submitted 25 May, 2023; v1 submitted 19 May, 2023;
originally announced May 2023.
-
Zero-Shot Text Classification via Self-Supervised Tuning
Authors:
Chaoqun Liu,
Wenxuan Zhang,
Guizhen Chen,
Xiaobao Wu,
Anh Tuan Luu,
Chip Hong Chang,
Lidong Bing
Abstract:
Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data…
▽ More
Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data, called self-supervised tuning. By exploring the inherent structure of free texts, we propose a new learning objective called first sentence prediction to bridge the gap between unlabeled data and text classification tasks. After tuning the model to learn to predict the first sentence in a paragraph based on the rest, the model is able to conduct zero-shot inference on unseen tasks such as topic classification and sentiment analysis. Experimental results show that our model outperforms the state-of-the-art baselines on 7 out of 10 tasks. Moreover, the analysis reveals that our model is less sensitive to the prompt design. Our code and pre-trained models are publicly available at https://github.com/DAMO-NLP-SG/SSTuning .
△ Less
Submitted 25 May, 2023; v1 submitted 19 May, 2023;
originally announced May 2023.
-
Reasoning Implicit Sentiment with Chain-of-Thought Prompting
Authors:
Hao Fei,
Bobo Li,
Qian Liu,
Lidong Bing,
Fei Li,
Tat-Seng Chua
Abstract:
While sentiment analysis systems try to determine the sentiment polarities of given targets based on the key opinion expressions in input texts, in implicit sentiment analysis (ISA) the opinion cues come in an implicit and obscure manner. Thus detecting implicit sentiment requires the common-sense and multi-hop reasoning ability to infer the latent intent of opinion. Inspired by the recent chain-o…
▽ More
While sentiment analysis systems try to determine the sentiment polarities of given targets based on the key opinion expressions in input texts, in implicit sentiment analysis (ISA) the opinion cues come in an implicit and obscure manner. Thus detecting implicit sentiment requires the common-sense and multi-hop reasoning ability to infer the latent intent of opinion. Inspired by the recent chain-of-thought (CoT) idea, in this work we introduce a Three-hop Reasoning (THOR) CoT framework to mimic the human-like reasoning process for ISA. We design a three-step prompting principle for THOR to step-by-step induce the implicit aspect, opinion, and finally the sentiment polarity. Our THOR+Flan-T5 (11B) pushes the state-of-the-art (SoTA) by over 6% F1 on supervised setup. More strikingly, THOR+GPT3 (175B) boosts the SoTA by over 50% F1 on zero-shot setting. Our code is open at https://github.com/scofield7419/THOR-ISA.
△ Less
Submitted 8 June, 2023; v1 submitted 18 May, 2023;
originally announced May 2023.
-
Bidirectional Generative Framework for Cross-domain Aspect-based Sentiment Analysis
Authors:
Yue Deng,
Wenxuan Zhang,
Sinno Jialin Pan,
Lidong Bing
Abstract:
Cross-domain aspect-based sentiment analysis (ABSA) aims to perform various fine-grained sentiment analysis tasks on a target domain by transferring knowledge from a source domain. Since labeled data only exists in the source domain, a model is expected to bridge the domain gap for tackling cross-domain ABSA. Though domain adaptation methods have proven to be effective, most of them are based on a…
▽ More
Cross-domain aspect-based sentiment analysis (ABSA) aims to perform various fine-grained sentiment analysis tasks on a target domain by transferring knowledge from a source domain. Since labeled data only exists in the source domain, a model is expected to bridge the domain gap for tackling cross-domain ABSA. Though domain adaptation methods have proven to be effective, most of them are based on a discriminative model, which needs to be specifically designed for different ABSA tasks. To offer a more general solution, we propose a unified bidirectional generative framework to tackle various cross-domain ABSA tasks. Specifically, our framework trains a generative model in both text-to-label and label-to-text directions. The former transforms each task into a unified format to learn domain-agnostic features, and the latter generates natural sentences from noisy labels for data augmentation, with which a more accurate model can be trained. To investigate the effectiveness and generality of our framework, we conduct extensive experiments on four cross-domain ABSA tasks and present new state-of-the-art results on all tasks. Our data and code are publicly available at \url{https://github.com/DAMO-NLP-SG/BGCA}.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
Easy-to-Hard Learning for Information Extraction
Authors:
Chang Gao,
Wenxuan Zhang,
Wai Lam,
Lidong Bing
Abstract:
Information extraction (IE) systems aim to automatically extract structured information, such as named entities, relations between entities, and events, from unstructured texts. While most existing work addresses a particular IE task, universally modeling various IE tasks with one model has achieved great success recently. Despite their success, they employ a one-stage learning strategy, i.e., dir…
▽ More
Information extraction (IE) systems aim to automatically extract structured information, such as named entities, relations between entities, and events, from unstructured texts. While most existing work addresses a particular IE task, universally modeling various IE tasks with one model has achieved great success recently. Despite their success, they employ a one-stage learning strategy, i.e., directly learning to extract the target structure given the input text, which contradicts the human learning process. In this paper, we propose a unified easy-to-hard learning framework consisting of three stages, i.e., the easy stage, the hard stage, and the main stage, for IE by mimicking the human learning process. By breaking down the learning process into multiple stages, our framework facilitates the model to acquire general IE task knowledge and improve its generalization ability. Extensive experiments across four IE tasks demonstrate the effectiveness of our framework. We achieve new state-of-the-art results on 13 out of 17 datasets. Our code is available at \url{https://github.com/DAMO-NLP-SG/IE-E2H}.
△ Less
Submitted 19 May, 2023; v1 submitted 16 May, 2023;
originally announced May 2023.
-
A Hierarchical Encoding-Decoding Scheme for Abstractive Multi-document Summarization
Authors:
Chenhui Shen,
Liying Cheng,
Xuan-Phi Nguyen,
Yang You,
Lidong Bing
Abstract:
Pre-trained language models (PLMs) have achieved outstanding achievements in abstractive single-document summarization (SDS). However, such benefits may not fully extend to multi-document summarization (MDS), where the handling of cross-document information is more complex. Previous works either design new MDS architectures or apply PLMs bluntly with concatenated source documents as a reformulated…
▽ More
Pre-trained language models (PLMs) have achieved outstanding achievements in abstractive single-document summarization (SDS). However, such benefits may not fully extend to multi-document summarization (MDS), where the handling of cross-document information is more complex. Previous works either design new MDS architectures or apply PLMs bluntly with concatenated source documents as a reformulated SDS task. While the former does not utilize previous pre-training efforts and may not generalize well across different domains, the latter may not sufficiently attend to the intricate cross-document relationships unique to MDS tasks. Instead, we enforce hierarchy on both the encoder and decoder to better utilize a PLM to facilitate multi-document interactions for the MDS task. Across 10 MDS benchmarks from various domains, our method outperforms or is competitive with the previous best models, including those with additional MDS pre-training or with more parameters. It outperforms its corresponding PLM backbone by up to 3 Rouge-L and is favored by humans.
△ Less
Submitted 1 November, 2023; v1 submitted 15 May, 2023;
originally announced May 2023.
-
NIKA2 Cosmological Legacy Survey: Survey Description and Galaxy Number Counts
Authors:
L. Bing,
M. Béthermin,
G. Lagache,
R. Adam,
P. Ade,
H. Ajeddig,
P. André,
E. Artis,
H. Aussel,
A. Beelen,
A. Benoît,
S. Berta,
N. Billot,
O. Bourrion,
M. Calvo,
A. Catalano,
M. De Petris,
F. -X. Désert,
S. Doyle,
E. F. C. Driessen,
D. Elbaz,
A. Gkogkou,
A. Gomez,
J. Goupy,
C. Hanser
, et al. (26 additional authors not shown)
Abstract:
Aims. Deep millimeter surveys are necessary to probe the dust-obscured galaxies at high redshift. We conducted a large observing program at 1.2 and 2 mm with the NIKA2 camera installed on the IRAM 30-meter telescope. This NIKA2 Cosmological Legacy Survey (N2CLS) covers two emblematic fields: GOODS-N and COSMOS. We introduce the N2CLS survey and present new 1.2 and 2 mm number count measurements ba…
▽ More
Aims. Deep millimeter surveys are necessary to probe the dust-obscured galaxies at high redshift. We conducted a large observing program at 1.2 and 2 mm with the NIKA2 camera installed on the IRAM 30-meter telescope. This NIKA2 Cosmological Legacy Survey (N2CLS) covers two emblematic fields: GOODS-N and COSMOS. We introduce the N2CLS survey and present new 1.2 and 2 mm number count measurements based on the tiered N2CLS observations from October 2017 to May 2021.
Methods. We develop an end-to-end simulation that combines an input sky model with the instrument noise and data reduction pipeline artifacts. This simulation is used to compute the sample purity, flux boosting, pipeline transfer function, completeness, and effective area of the survey. We used the 117 deg$^2$ SIDES simulations as the sky model, which include the galaxy clustering. Our formalism allows us to correct the source number counts to obtain galaxy number counts, the difference between the two being due to resolution effects caused by the blending of several galaxies inside the large beam of single-dish instruments.
Results. The N2CLS-May2021 survey reaches an average 1-$σ$ noise level of 0.17 and 0.048 mJy on GOODS-N over 159 arcmin$^2$, and 0.46 and 0.14 mJy on COSMOS over 1010 arcmin$^2$, at 1.2 and 2 mm, respectively. For a purity threshold of 80%, we detect 120 and 67 sources in GOODS-N and 195 and 76 sources in COSMOS, at 1.2 and 2 mm, respectively. Our measurement connects the bright single-dish to the deep interferometric number counts. After correcting for resolution effects, our results reconcile the single-dish and interferometric number counts and are further accurately compared with model predictions.
△ Less
Submitted 11 May, 2023;
originally announced May 2023.
-
Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework
Authors:
Ruochen Zhao,
Xingxuan Li,
Shafiq Joty,
Chengwei Qin,
Lidong Bing
Abstract:
As large language models (LLMs) have become the norm in NLP, demonstrating good performance in generation and reasoning tasks, one of its most fatal disadvantages is the lack of factual correctness. Generating unfactual texts not only leads to lower performances but also degrades the trust and validity of their applications. Chain-of-Thought (CoT) prompting improves trust and model performance on…
▽ More
As large language models (LLMs) have become the norm in NLP, demonstrating good performance in generation and reasoning tasks, one of its most fatal disadvantages is the lack of factual correctness. Generating unfactual texts not only leads to lower performances but also degrades the trust and validity of their applications. Chain-of-Thought (CoT) prompting improves trust and model performance on complex reasoning tasks by generating interpretable reasoning chains, but still suffers from factuality concerns in knowledge-intensive tasks. In this paper, we propose the Verify-and-Edit framework for CoT prompting, which seeks to increase prediction factuality by post-editing reasoning chains according to external knowledge. Building on top of GPT-3, our framework lead to accuracy improvements in multiple open-domain question-answering tasks.
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
Can ChatGPT-like Generative Models Guarantee Factual Accuracy? On the Mistakes of New Generation Search Engines
Authors:
Ruochen Zhao,
Xingxuan Li,
Yew Ken Chia,
Bosheng Ding,
Lidong Bing
Abstract:
Although large conversational AI models such as OpenAI's ChatGPT have demonstrated great potential, we question whether such models can guarantee factual accuracy. Recently, technology companies such as Microsoft and Google have announced new services which aim to combine search engines with conversational AI. However, we have found numerous mistakes in the public demonstrations that suggest we sh…
▽ More
Although large conversational AI models such as OpenAI's ChatGPT have demonstrated great potential, we question whether such models can guarantee factual accuracy. Recently, technology companies such as Microsoft and Google have announced new services which aim to combine search engines with conversational AI. However, we have found numerous mistakes in the public demonstrations that suggest we should not easily trust the factual claims of the AI models. Rather than criticizing specific models or companies, we hope to call on researchers and developers to improve AI models' transparency and factual correctness.
△ Less
Submitted 2 March, 2023;
originally announced April 2023.
-
LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models
Authors:
Zhiqiang Hu,
Lei Wang,
Yihuai Lan,
Wanyu Xu,
Ee-Peng Lim,
Lidong Bing,
Xing Xu,
Soujanya Poria,
Roy Ka-Wei Lee
Abstract:
The success of large language models (LLMs), like GPT-4 and ChatGPT, has led to the development of numerous cost-effective and accessible alternatives that are created by finetuning open-access LLMs with task-specific data (e.g., ChatDoctor) or instruction data (e.g., Alpaca). Among the various fine-tuning methods, adapter-based parameter-efficient fine-tuning (PEFT) is undoubtedly one of the most…
▽ More
The success of large language models (LLMs), like GPT-4 and ChatGPT, has led to the development of numerous cost-effective and accessible alternatives that are created by finetuning open-access LLMs with task-specific data (e.g., ChatDoctor) or instruction data (e.g., Alpaca). Among the various fine-tuning methods, adapter-based parameter-efficient fine-tuning (PEFT) is undoubtedly one of the most attractive topics, as it only requires fine-tuning a few external parameters instead of the entire LLMs while achieving comparable or even better performance. To enable further research on PEFT methods of LLMs, this paper presents LLM-Adapters, an easy-to-use framework that integrates various adapters into LLMs and can execute these adapter-based PEFT methods of LLMs for different tasks. The framework includes state-of-the-art open-access LLMs such as LLaMA, BLOOM, and GPT-J, as well as widely used adapters such as Series adapters, Parallel adapter, Prompt-based learning and Reparametrization-based methods. Moreover, we conduct extensive empirical studies on the impact of adapter types, placement locations, and hyper-parameters to the best design for each adapter-based methods. We evaluate the effectiveness of the adapters on fourteen datasets from two different reasoning tasks, Arithmetic Reasoning and Commonsense Reasoning. The results demonstrate that using adapter-based PEFT in smaller-scale LLMs (7B) with few extra trainable parameters yields comparable, and in some cases superior, performance to powerful LLMs (175B) in zero-shot inference on both reasoning tasks.
△ Less
Submitted 9 October, 2023; v1 submitted 4 April, 2023;
originally announced April 2023.
-
Towards Integration of Discriminability and Robustness for Document-Level Relation Extraction
Authors:
Jia Guo,
Stanley Kok,
Lidong Bing
Abstract:
Document-level relation extraction (DocRE) predicts relations for entity pairs that rely on long-range context-dependent reasoning in a document. As a typical multi-label classification problem, DocRE faces the challenge of effectively distinguishing a small set of positive relations from the majority of negative ones. This challenge becomes even more difficult to overcome when there exists a sign…
▽ More
Document-level relation extraction (DocRE) predicts relations for entity pairs that rely on long-range context-dependent reasoning in a document. As a typical multi-label classification problem, DocRE faces the challenge of effectively distinguishing a small set of positive relations from the majority of negative ones. This challenge becomes even more difficult to overcome when there exists a significant number of annotation errors in the dataset. In this work, we aim to achieve better integration of both the discriminability and robustness for the DocRE problem. Specifically, we first design an effective loss function to endow high discriminability to both probabilistic outputs and internal representations. We innovatively customize entropy minimization and supervised contrastive learning for the challenging multi-label and long-tailed learning problems. To ameliorate the impact of label errors, we equipped our method with a novel negative label sampling strategy to strengthen the model robustness. In addition, we introduce two new data regimes to mimic more realistic scenarios with annotation errors and evaluate our sampling strategy. Experimental results verify the effectiveness of each component and show that our method achieves new state-of-the-art results on the DocRED dataset, its recently cleaned version, Re-DocRED, and the proposed data regimes.
△ Less
Submitted 3 April, 2023;
originally announced April 2023.
-
Evaluating Psychological Safety of Large Language Models
Authors:
Xingxuan Li,
Yutong Li,
Lin Qiu,
Shafiq Joty,
Lidong Bing
Abstract:
In this work, we designed unbiased prompts to systematically evaluate the psychological safety of large language models (LLMs). First, we tested five different LLMs by using two personality tests: Short Dark Triad (SD-3) and Big Five Inventory (BFI). All models scored higher than the human average on SD-3, suggesting a relatively darker personality pattern. Despite being instruction fine-tuned wit…
▽ More
In this work, we designed unbiased prompts to systematically evaluate the psychological safety of large language models (LLMs). First, we tested five different LLMs by using two personality tests: Short Dark Triad (SD-3) and Big Five Inventory (BFI). All models scored higher than the human average on SD-3, suggesting a relatively darker personality pattern. Despite being instruction fine-tuned with safety metrics to reduce toxicity, InstructGPT, GPT-3.5, and GPT-4 still showed dark personality patterns; these models scored higher than self-supervised GPT-3 on the Machiavellianism and narcissism traits on SD-3. Then, we evaluated the LLMs in the GPT series by using well-being tests to study the impact of fine-tuning with more training data. We observed a continuous increase in the well-being scores of GPT models. Following these observations, we showed that fine-tuning Llama-2-chat-7B with responses from BFI using direct preference optimization could effectively reduce the psychological toxicity of the model. Based on the findings, we recommended the application of systematic and comprehensive psychological metrics to further evaluate and improve the safety of LLMs.
△ Less
Submitted 29 February, 2024; v1 submitted 20 December, 2022;
originally announced December 2022.
-
Is GPT-3 a Good Data Annotator?
Authors:
Bosheng Ding,
Chengwei Qin,
Linlin Liu,
Yew Ken Chia,
Shafiq Joty,
Boyang Li,
Lidong Bing
Abstract:
Data annotation is the process of labeling data that could be used to train machine learning models. Having high-quality annotation is crucial, as it allows the model to learn the relationship between the input data and the desired output. GPT-3, a large-scale language model developed by OpenAI, has demonstrated impressive zero- and few-shot performance on a wide range of NLP tasks. It is therefor…
▽ More
Data annotation is the process of labeling data that could be used to train machine learning models. Having high-quality annotation is crucial, as it allows the model to learn the relationship between the input data and the desired output. GPT-3, a large-scale language model developed by OpenAI, has demonstrated impressive zero- and few-shot performance on a wide range of NLP tasks. It is therefore natural to wonder whether it can be used to effectively annotate data for NLP tasks. In this paper, we evaluate the performance of GPT-3 as a data annotator by comparing it with traditional data annotation methods and analyzing its output on a range of tasks. Through this analysis, we aim to provide insight into the potential of GPT-3 as a general-purpose data annotator in NLP.
△ Less
Submitted 14 June, 2023; v1 submitted 20 December, 2022;
originally announced December 2022.
-
From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader
Authors:
Weiwen Xu,
Xin Li,
Wenxuan Zhang,
Meng Zhou,
Wai Lam,
Luo Si,
Lidong Bing
Abstract:
We present Pre-trained Machine Reader (PMR), a novel method for retrofitting pre-trained masked language models (MLMs) to pre-trained machine reading comprehension (MRC) models without acquiring labeled data. PMR can resolve the discrepancy between model pre-training and downstream fine-tuning of existing MLMs. To build the proposed PMR, we constructed a large volume of general-purpose and high-qu…
▽ More
We present Pre-trained Machine Reader (PMR), a novel method for retrofitting pre-trained masked language models (MLMs) to pre-trained machine reading comprehension (MRC) models without acquiring labeled data. PMR can resolve the discrepancy between model pre-training and downstream fine-tuning of existing MLMs. To build the proposed PMR, we constructed a large volume of general-purpose and high-quality MRC-style training data by using Wikipedia hyperlinks and designed a Wiki Anchor Extraction task to guide the MRC-style pre-training. Apart from its simplicity, PMR effectively solves extraction tasks, such as Extractive Question Answering and Named Entity Recognition. PMR shows tremendous improvements over existing approaches, especially in low-resource scenarios. When applied to the sequence classification task in the MRC formulation, PMR enables the extraction of high-quality rationales to explain the classification process, thereby providing greater prediction explainability. PMR also has the potential to serve as a unified model for tackling various extraction and classification tasks in the MRC formulation.
△ Less
Submitted 16 October, 2023; v1 submitted 9 December, 2022;
originally announced December 2022.
-
On the Effectiveness of Parameter-Efficient Fine-Tuning
Authors:
Zihao Fu,
Haoran Yang,
Anthony Man-Cho So,
Wai Lam,
Lidong Bing,
Nigel Collier
Abstract:
Fine-tuning pre-trained models has been ubiquitously proven to be effective in a wide range of NLP tasks. However, fine-tuning the whole model is parameter inefficient as it always yields an entirely new model for each task. Currently, many research works propose to only fine-tune a small portion of the parameters while kee** most of the parameters shared across different tasks. These methods ac…
▽ More
Fine-tuning pre-trained models has been ubiquitously proven to be effective in a wide range of NLP tasks. However, fine-tuning the whole model is parameter inefficient as it always yields an entirely new model for each task. Currently, many research works propose to only fine-tune a small portion of the parameters while kee** most of the parameters shared across different tasks. These methods achieve surprisingly good performance and are shown to be more stable than their corresponding fully fine-tuned counterparts. However, such kind of methods is still not well understood. Some natural questions arise: How does the parameter sparsity lead to promising performance? Why is the model more stable than the fully fine-tuned models? How to choose the tunable parameters? In this paper, we first categorize the existing methods into random approaches, rule-based approaches, and projection-based approaches based on how they choose which parameters to tune. Then, we show that all of the methods are actually sparse fine-tuned models and conduct a novel theoretical analysis of them. We indicate that the sparsity is actually imposing a regularization on the original model by controlling the upper bound of the stability. Such stability leads to better generalization capability which has been empirically observed in a lot of recent research works. Despite the effectiveness of sparsity grounded by our theory, it still remains an open problem of how to choose the tunable parameters. To better choose the tunable parameters, we propose a novel Second-order Approximation Method (SAM) which approximates the original problem with an analytically solvable optimization function. The tunable parameters are determined by directly optimizing the approximation function. The experimental results show that our proposed SAM model outperforms many strong baseline models and it also verifies our theoretical analysis.
△ Less
Submitted 28 November, 2022;
originally announced November 2022.
-
A Dataset for Hyper-Relational Extraction and a Cube-Filling Approach
Authors:
Yew Ken Chia,
Lidong Bing,
Sharifah Mahani Aljunied,
Luo Si,
Soujanya Poria
Abstract:
Relation extraction has the potential for large-scale knowledge graph construction, but current methods do not consider the qualifier attributes for each relation triplet, such as time, quantity or location. The qualifiers form hyper-relational facts which better capture the rich and complex knowledge graph structure. For example, the relation triplet (Leonard Parker, Educated At, Harvard Universi…
▽ More
Relation extraction has the potential for large-scale knowledge graph construction, but current methods do not consider the qualifier attributes for each relation triplet, such as time, quantity or location. The qualifiers form hyper-relational facts which better capture the rich and complex knowledge graph structure. For example, the relation triplet (Leonard Parker, Educated At, Harvard University) can be factually enriched by including the qualifier (End Time, 1967). Hence, we propose the task of hyper-relational extraction to extract more specific and complete facts from text. To support the task, we construct HyperRED, a large-scale and general-purpose dataset. Existing models cannot perform hyper-relational extraction as it requires a model to consider the interaction between three entities. Hence, we propose CubeRE, a cube-filling model inspired by table-filling approaches and explicitly considers the interaction between relation triplets and qualifiers. To improve model scalability and reduce negative class imbalance, we further propose a cube-pruning method. Our experiments show that CubeRE outperforms strong baselines and reveal possible directions for future research. Our code and data are available at github.com/declare-lab/HyperRED.
△ Less
Submitted 17 November, 2022;
originally announced November 2022.
-
ConNER: Consistency Training for Cross-lingual Named Entity Recognition
Authors:
Ran Zhou,
Xin Li,
Lidong Bing,
Erik Cambria,
Luo Si,
Chunyan Miao
Abstract:
Cross-lingual named entity recognition (NER) suffers from data scarcity in the target languages, especially under zero-shot settings. Existing translate-train or knowledge distillation methods attempt to bridge the language gap, but often introduce a high level of noise. To solve this problem, consistency training methods regularize the model to be robust towards perturbations on data or hidden st…
▽ More
Cross-lingual named entity recognition (NER) suffers from data scarcity in the target languages, especially under zero-shot settings. Existing translate-train or knowledge distillation methods attempt to bridge the language gap, but often introduce a high level of noise. To solve this problem, consistency training methods regularize the model to be robust towards perturbations on data or hidden states. However, such methods are likely to violate the consistency hypothesis, or mainly focus on coarse-grain consistency. We propose ConNER as a novel consistency training framework for cross-lingual NER, which comprises of: (1) translation-based consistency training on unlabeled target-language data, and (2) dropoutbased consistency training on labeled source-language data. ConNER effectively leverages unlabeled target-language data and alleviates overfitting on the source language to enhance the cross-lingual adaptability. Experimental results show our ConNER achieves consistent improvement over various baseline methods.
△ Less
Submitted 17 November, 2022;
originally announced November 2022.
-
Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed Representations
Authors:
Linlin Liu,
Xingxuan Li,
Megh Thakkar,
Xin Li,
Shafiq Joty,
Luo Si,
Lidong Bing
Abstract:
Due to the huge amount of parameters, fine-tuning of pretrained language models (PLMs) is prone to overfitting in the low resource scenarios. In this work, we present a novel method that operates on the hidden representations of a PLM to reduce overfitting. During fine-tuning, our method inserts random autoencoders between the hidden layers of a PLM, which transform activations from the previous l…
▽ More
Due to the huge amount of parameters, fine-tuning of pretrained language models (PLMs) is prone to overfitting in the low resource scenarios. In this work, we present a novel method that operates on the hidden representations of a PLM to reduce overfitting. During fine-tuning, our method inserts random autoencoders between the hidden layers of a PLM, which transform activations from the previous layers into multi-view compressed representations before feeding them into the upper layers. The autoencoders are plugged out after fine-tuning, so our method does not add extra parameters or increase computation cost during inference. Our method demonstrates promising performance improvement across a wide range of sequence- and token-level low-resource NLP tasks.
△ Less
Submitted 26 May, 2023; v1 submitted 16 November, 2022;
originally announced November 2022.
-
Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness Predictions
Authors:
Thong Nguyen,
Xiaobao Wu,
Anh-Tuan Luu,
Cong-Duy Nguyen,
Zhen Hai,
Lidong Bing
Abstract:
Modern Review Helpfulness Prediction systems are dependent upon multiple modalities, typically texts and images. Unfortunately, those contemporary approaches pay scarce attention to polish representations of cross-modal relations and tend to suffer from inferior optimization. This might cause harm to model's predictions in numerous cases. To overcome the aforementioned issues, we propose Multimoda…
▽ More
Modern Review Helpfulness Prediction systems are dependent upon multiple modalities, typically texts and images. Unfortunately, those contemporary approaches pay scarce attention to polish representations of cross-modal relations and tend to suffer from inferior optimization. This might cause harm to model's predictions in numerous cases. To overcome the aforementioned issues, we propose Multimodal Contrastive Learning for Multimodal Review Helpfulness Prediction (MRHP) problem, concentrating on mutual information between input modalities to explicitly elaborate cross-modal relations. In addition, we introduce Adaptive Weighting scheme for our contrastive learning approach in order to increase flexibility in optimization. Lastly, we propose Multimodal Interaction module to address the unalignment nature of multimodal data, thereby assisting the model in producing more reasonable multimodal representations. Experimental results show that our method outperforms prior baselines and achieves state-of-the-art results on two publicly available benchmark datasets for MRHP problem.
△ Less
Submitted 7 November, 2022;
originally announced November 2022.
-
SentBS: Sentence-level Beam Search for Controllable Summarization
Authors:
Chenhui Shen,
Liying Cheng,
Lidong Bing,
Yang You,
Luo Si
Abstract:
A wide range of control perspectives have been explored in controllable text generation. Structure-controlled summarization is recently proposed as a useful and interesting research direction. However, current structure-controlling methods have limited effectiveness in enforcing the desired structure. To address this limitation, we propose a sentence-level beam search generation method (SentBS), w…
▽ More
A wide range of control perspectives have been explored in controllable text generation. Structure-controlled summarization is recently proposed as a useful and interesting research direction. However, current structure-controlling methods have limited effectiveness in enforcing the desired structure. To address this limitation, we propose a sentence-level beam search generation method (SentBS), where evaluation is conducted throughout the generation process to select suitable sentences for subsequent generations. We experiment with different combinations of decoding methods to be used as subcomponents by SentBS and evaluate results on the structure-controlled dataset MReD. Experiments show that all explored combinations for SentBS can improve the agreement between the generated text and the desired structure, with the best method significantly reducing the structural discrepancies suffered by the existing model, by approximately 68%.
△ Less
Submitted 23 February, 2023; v1 submitted 26 October, 2022;
originally announced October 2022.
-
Retrofitting Multilingual Sentence Embeddings with Abstract Meaning Representation
Authors:
Deng Cai,
Xin Li,
Jackie Chun-Sing Ho,
Lidong Bing,
Wai Lam
Abstract:
We introduce a new method to improve existing multilingual sentence embeddings with Abstract Meaning Representation (AMR). Compared with the original textual input, AMR is a structured semantic representation that presents the core concepts and relations in a sentence explicitly and unambiguously. It also helps reduce surface variations across different expressions and languages. Unlike most prior…
▽ More
We introduce a new method to improve existing multilingual sentence embeddings with Abstract Meaning Representation (AMR). Compared with the original textual input, AMR is a structured semantic representation that presents the core concepts and relations in a sentence explicitly and unambiguously. It also helps reduce surface variations across different expressions and languages. Unlike most prior work that only evaluates the ability to measure semantic similarity, we present a thorough evaluation of existing multilingual sentence embeddings and our improved versions, which include a collection of five transfer tasks in different downstream applications. Experiment results show that retrofitting multilingual sentence embeddings with AMR leads to better state-of-the-art performance on both semantic textual similarity and transfer tasks. Our codebase and evaluation scripts can be found at \url{https://github.com/jcyk/MSE-AMR}.
△ Less
Submitted 18 October, 2022;
originally announced October 2022.
-
PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification Tasks
Authors:
Weiwen Xu,
Xin Li,
Yang Deng,
Wai Lam,
Lidong Bing
Abstract:
Span identification aims at identifying specific text spans from text input and classifying them into pre-defined categories. Different from previous works that merely leverage the Subordinate (SUB) relation (i.e. if a span is an instance of a certain category) to train models, this paper for the first time explores the Peer (PR) relation, which indicates that two spans are instances of the same c…
▽ More
Span identification aims at identifying specific text spans from text input and classifying them into pre-defined categories. Different from previous works that merely leverage the Subordinate (SUB) relation (i.e. if a span is an instance of a certain category) to train models, this paper for the first time explores the Peer (PR) relation, which indicates that two spans are instances of the same category and share similar features. Specifically, a novel Peer Data Augmentation (PeerDA) approach is proposed which employs span pairs with the PR relation as the augmentation data for training. PeerDA has two unique advantages: (1) There are a large number of PR span pairs for augmenting the training data. (2) The augmented data can prevent the trained model from over-fitting the superficial span-category map** by pushing the model to leverage the span semantics. Experimental results on ten datasets over four diverse tasks across seven domains demonstrate the effectiveness of PeerDA. Notably, PeerDA achieves state-of-the-art results on six of them.
△ Less
Submitted 18 May, 2023; v1 submitted 17 October, 2022;
originally announced October 2022.
-
The hidden side of cosmic star formation at z > 3: Bridging optically-dark and Lyman break galaxies with GOODS-ALMA
Authors:
Mengyuan Xiao,
David Elbaz,
Carlos Gómez-Guijarro,
Lucas Leroy,
Longji Bing,
Emanuele Daddi,
Benjamin Magnelli,
Maximilien Franco,
Luwenjia Zhou,
Mark Dickinson,
Tao Wang,
Wiphu Rujopakarn,
Georgios E. Magdis,
Ezequiel Treister,
Hanae Inami,
Ricardo Demarco,
Mark T. Sargent,
Xinwen Shu,
Jeyhan S. Kartaltepe,
David M. Alexander,
Matthieu Béthermin,
Frederic Bournaud,
Laure Ciesla,
Henry C. Ferguson,
Steven L. Finkelstein
, et al. (15 additional authors not shown)
Abstract:
Our current understanding of the cosmic star formation history at z>3 is primarily based on UV-selected galaxies (i.e., LBGs). Recent studies of H-dropouts have revealed that we may be missing a large proportion of star formation that is taking place in massive galaxies at z>3. In this work, we extend the H-dropout criterion to lower masses to select optically dark/faint galaxies (OFGs), in order…
▽ More
Our current understanding of the cosmic star formation history at z>3 is primarily based on UV-selected galaxies (i.e., LBGs). Recent studies of H-dropouts have revealed that we may be missing a large proportion of star formation that is taking place in massive galaxies at z>3. In this work, we extend the H-dropout criterion to lower masses to select optically dark/faint galaxies (OFGs), in order to complete the census between LBGs and H-dropouts. Our criterion (H> 26.5 mag & [4.5] < 25 mag) combined with a de-blending technique is designed to select not only extremely dust-obscured massive galaxies but also normal star-forming galaxies. In total, we identified 27 OFGs at z_phot > 3 (z_med=4.1) in the GOODS-ALMA field, covering a wide distribution of stellar masses with log($M_{\star}$/$M_{\odot}$) = 9.4-11.1. We find that up to 75% of the OFGs with log($M_{\star}$/$M_{\odot}$) = 9.5-10.5 were neglected by previous LBGs and H-dropout selection techniques. After performing stacking analyses, the OFGs exhibit shorter gas depletion timescales, slightly lower gas fractions, and lower dust temperatures than typical star-forming galaxies. Their SFR_tot (SFR_ IR+SFR_UV) is much larger than SFR_UVcorr (corrected for dust extinction), with SFR_tot/SFR_UVcorr = $8\pm1$, suggesting the presence of hidden dust regions in the OFGs that absorb all UV photons. The average dust size measured by a circular Gaussian model fit is R_e(1.13 mm)=1.01$\pm$0.05 kpc. We find that the cosmic SFRD at z>3 contributed by massive OFGs is at least two orders of magnitude higher than the one contributed by equivalently massive LBGs. Finally, we calculate the combined contribution of OFGs and LBGs to the cosmic SFRD at z=4-5 to be 4 $\times$ 10$^{-2}$ $M_{\odot}$ yr$^{-1}$Mpc$^{-3}$, which is about 0.15 dex (43%) higher than the SFRD derived from UV-selected samples alone at the same redshift.
△ Less
Submitted 10 February, 2023; v1 submitted 6 October, 2022;
originally announced October 2022.
-
Candidate cosmic filament in the GJ526 field, mapped with the NIKA2 camera
Authors:
J. -F. Lestrade,
F. -X. Desert,
G. Lagache,
R. Adam,
P. Ade,
H. Ajeddig,
P. Andre,
E. Artis,
H. Aussel,
A. Beelen,
A. Benoit,
S. Berta,
M. Bethermin,
L. Bing,
O. Bourrion,
M. Calvo,
A. Catalano,
A. Coulais,
M. De Petris,
S. Doyle,
E. F. C. Driessen,
A. Gomez,
J. Goupy,
F. Keruzore,
C. Kramer
, et al. (22 additional authors not shown)
Abstract:
Distinctive large-scale structures have been identified in the spatial distribution of optical galaxies up to redshift z ~ 1. In the more distant universe, the relationship between the dust-obscured population of star-forming galaxies observed at millimetre wavelengths and the network of cosmic filaments of dark matter apparent in all cosmological hydrodynamical simulations is still under study. U…
▽ More
Distinctive large-scale structures have been identified in the spatial distribution of optical galaxies up to redshift z ~ 1. In the more distant universe, the relationship between the dust-obscured population of star-forming galaxies observed at millimetre wavelengths and the network of cosmic filaments of dark matter apparent in all cosmological hydrodynamical simulations is still under study. Using the NIKA2 dual-band millimetre camera, we mapped a field of ~ 90 arcminutes^2 in the direction of the star GJ526 simultaneously in its 1.15-mm and 2.0-mm continuum wavebands to investigate the nature of the quasi-alignment of five sources found ten years earlier with the MAMBO camera at 1.2 mm. We find that these sources are not clumps of a circumstellar debris disc around this star as initially hypothesized. Rather, they must be dust-obscured star-forming galaxies, or sub-millimetre galaxies (SMGs), in the distant background. The new NIKA2 map at 1.15 mm reveals a total of seven SMGs distributed in projection on the sky along a filament-like structure crossing the whole observed field. Furthermore, we show that the NIKA2 and supplemental Herschel photometric data are compatible with a model of the spectral energy distributions (SEDs) of these sources when a common redshift of 2.5 and typical values of the dust parameters for SMGs are adopted. Hence, we speculate that these SMGs might be located in a filament of the distant `cosmic web'. The length of this candidate cosmic filament crossing the whole map is at least 4 cMpc (comoving), and the separations between sources are between 0.25 cMpc and 1.25 cMpc at this redshift, in line with expectations from cosmological simulations. Nonetheless, further observations to determine the precise spectroscopic redshifts of these sources are required to definitively support this hypothesis of SMGs embedded in a cosmic filament of dark matter.
△ Less
Submitted 26 September, 2022;
originally announced September 2022.
-
Informative Text Generation from Knowledge Triples
Authors:
Zihao Fu,
Yijiang River Dong,
Lidong Bing,
Wai Lam
Abstract:
As the development of the encoder-decoder architecture, researchers are able to study the text generation tasks with broader types of data. Among them, KB-to-text aims at converting a set of knowledge triples into human readable sentences. In the original setting, the task assumes that the input triples and the text are exactly aligned in the perspective of the embodied knowledge/information. In t…
▽ More
As the development of the encoder-decoder architecture, researchers are able to study the text generation tasks with broader types of data. Among them, KB-to-text aims at converting a set of knowledge triples into human readable sentences. In the original setting, the task assumes that the input triples and the text are exactly aligned in the perspective of the embodied knowledge/information. In this paper, we extend this setting and explore how to facilitate the trained model to generate more informative text, namely, containing more information about the triple entities but not conveyed by the input triples. To solve this problem, we propose a novel memory augmented generator that employs a memory network to memorize the useful knowledge learned during the training and utilizes such information together with the input triples to generate text in the operational or testing phase. We derive a dataset from WebNLG for our new setting and conduct extensive experiments to investigate the effectiveness of our model as well as uncover the intrinsic characteristics of the setting.
△ Less
Submitted 26 September, 2022;
originally announced September 2022.
-
Multi-probe analysis of the galaxy cluster CL J1226.9+3332: Hydrostatic mass and hydrostatic-to-lensing bias
Authors:
M. Muñoz-Echeverría,
J. F. Macías-Pérez,
G. W. Pratt,
R. Adam,
P. Ade,
H. Ajeddig,
P. André,
M. Arnaud,
E. Artis,
H. Aussel,
I. Bartalucci,
A. Beelen,
A. Benoît,
S. Berta,
L. Bing,
O. Bourrion,
M. Calvo,
A. Catalano,
M. De Petris,
F. -X. Désert,
S. Doyle,
E. F. C. Driessen,
A. Ferragamo,
A. Gomez,
J. Goupy
, et al. (28 additional authors not shown)
Abstract:
The precise estimation of the mass of galaxy clusters is a major issue for cosmology. Large galaxy cluster surveys rely on scaling laws that relate cluster observables to their masses. From the high resolution observations of ~ 45 galaxy clusters with NIKA2 and XMM-Newton instruments, the NIKA2 SZ Large Program should provide an accurate scaling relation between the thermal Sunyaev-Zel'dovich effe…
▽ More
The precise estimation of the mass of galaxy clusters is a major issue for cosmology. Large galaxy cluster surveys rely on scaling laws that relate cluster observables to their masses. From the high resolution observations of ~ 45 galaxy clusters with NIKA2 and XMM-Newton instruments, the NIKA2 SZ Large Program should provide an accurate scaling relation between the thermal Sunyaev-Zel'dovich effect and the hydrostatic mass. In this paper, we present an exhaustive analysis of the hydrostatic mass of the well known galaxy cluster CL J1226.9+3332, the highest-redshift cluster in the NIKA2 SZ Large Program at z = 0.89. We combine the NIKA2 observations with thermal Sunyaev-Zel'dovich data from NIKA, Bolocam and MUSTANG instruments and XMM-Newton X-ray observations and test the impact of the systematic effects on the mass reconstruction. We conclude that slight differences in the shape of the mass profile can be crucial when defining the integrated mass at R500, which demonstrates the importance of the modeling in the mass determination. We prove the robustness of our hydrostatic mass estimates by showing the agreement with all the results found in the literature. Another key information for cosmology is the bias of the masses estimated assuming hydrostatic equilibrium hypothesis. Based on the lensing convergence maps from the Cluster Lensing And Supernova survey with Hubble (CLASH) data, we obtain the lensing mass estimate for CL J1226.9+3332. From this we are able to measure the hydrostatic-to-lensing mass bias for this cluster, that spans from 1 - bHSE/lens ~ 0.7 to 1, presenting the impact of data-sets and mass reconstruction models on the bias.
△ Less
Submitted 15 September, 2022;
originally announced September 2022.
-
SANCL: Multimodal Review Helpfulness Prediction with Selective Attention and Natural Contrastive Learning
Authors:
Wei Han,
Hui Chen,
Zhen Hai,
Soujanya Poria,
Lidong Bing
Abstract:
With the boom of e-commerce, Multimodal Review Helpfulness Prediction (MRHP), which aims to sort product reviews according to the predicted helpfulness scores has become a research hotspot. Previous work on this task focuses on attention-based modality fusion, information integration, and relation modeling, which primarily exposes the following drawbacks: 1) the model may fail to capture the reall…
▽ More
With the boom of e-commerce, Multimodal Review Helpfulness Prediction (MRHP), which aims to sort product reviews according to the predicted helpfulness scores has become a research hotspot. Previous work on this task focuses on attention-based modality fusion, information integration, and relation modeling, which primarily exposes the following drawbacks: 1) the model may fail to capture the really essential information due to its indiscriminate attention formulation; 2) lack appropriate modeling methods that take full advantage of correlation among provided data. In this paper, we propose SANCL: Selective Attention and Natural Contrastive Learning for MRHP. SANCL adopts a probe-based strategy to enforce high attention weights on the regions of greater significance. It also constructs a contrastive learning framework based on natural matching properties in the dataset. Experimental results on two benchmark datasets with three categories show that SANCL achieves state-of-the-art baseline performance with lower memory consumption.
△ Less
Submitted 5 October, 2022; v1 submitted 12 September, 2022;
originally announced September 2022.
-
Revisiting DocRED -- Addressing the False Negative Problem in Relation Extraction
Authors:
Qingyu Tan,
Lu Xu,
Lidong Bing,
Hwee Tou Ng,
Sharifah Mahani Aljunied
Abstract:
The DocRED dataset is one of the most popular and widely used benchmarks for document-level relation extraction (RE). It adopts a recommend-revise annotation scheme so as to have a large-scale annotated dataset. However, we find that the annotation of DocRED is incomplete, i.e., false negative samples are prevalent. We analyze the causes and effects of the overwhelming false negative problem in th…
▽ More
The DocRED dataset is one of the most popular and widely used benchmarks for document-level relation extraction (RE). It adopts a recommend-revise annotation scheme so as to have a large-scale annotated dataset. However, we find that the annotation of DocRED is incomplete, i.e., false negative samples are prevalent. We analyze the causes and effects of the overwhelming false negative problem in the DocRED dataset. To address the shortcoming, we re-annotate 4,053 documents in the DocRED dataset by adding the missed relation triples back to the original DocRED. We name our revised DocRED dataset Re-DocRED. We conduct extensive experiments with state-of-the-art neural models on both datasets, and the experimental results show that the models trained and evaluated on our Re-DocRED achieve performance improvements of around 13 F1 points. Moreover, we conduct a comprehensive analysis to identify the potential areas for further improvement. Our dataset is publicly available at https://github.com/tonytan48/Re-DocRED.
△ Less
Submitted 16 June, 2023; v1 submitted 25 May, 2022;
originally announced May 2022.
-
Starbursts with suppressed velocity dispersion revealed in a forming cluster at z=2.51
Authors:
Mengyuan Xiao,
Tao Wang,
David Elbaz,
Daisuke Iono,
Xing Lu,
Longji Bing,
Emanuele Daddi,
Benjamin Magnelli,
Carlos Gómez-Guijarro,
Frederic Bournaud,
Qiusheng Gu,
Shuowen **,
Francesco Valentino,
Anita Zanella,
Raphael Gobat,
Sergio Martin,
Gabriel Brammer,
Kotaro Kohno,
Corentin Schreiber,
Laure Ciesla,
Xiaoling Yu,
Koryo Okumura
Abstract:
One of the most prominent features of galaxy clusters is the presence of a dominant population of massive ellipticals in their cores. Stellar archaeology suggests that these gigantic beasts assembled most of their stars in the early Universe via starbursts. However, the role of dense environments and their detailed physical mechanisms in triggering starburst activities remain unknown. Here we repo…
▽ More
One of the most prominent features of galaxy clusters is the presence of a dominant population of massive ellipticals in their cores. Stellar archaeology suggests that these gigantic beasts assembled most of their stars in the early Universe via starbursts. However, the role of dense environments and their detailed physical mechanisms in triggering starburst activities remain unknown. Here we report spatially resolved Atacama Large Millimeter/submillimeter Array (ALMA) observations of the CO $J= 3-2$ emission line, with a resolution of about 2.5 kiloparsecs, toward a forming galaxy cluster core with starburst galaxies at $z=2.51$. In contrast to starburst galaxies in the field often associated with galaxy mergers or highly turbulent gaseous disks, our observations show that the two starbursts in the cluster exhibit dynamically cold (rotation-dominated) gas-rich disks. Their gas disks have extremely low velocity dispersion ($σ_{\mathrm{0}} \sim 20-30$ km s$^{-1}$), which is three times lower than their field counterparts at similar redshifts. The high gas fraction and suppressed velocity dispersion yield gravitationally unstable gas disks, which enables highly efficient star formation. The suppressed velocity dispersion, likely induced by the accretion of corotating and coplanar cold gas, might serve as an essential avenue to trigger starbursts in massive halos at high redshifts.
△ Less
Submitted 16 May, 2022;
originally announced May 2022.
-
Massive merging cluster PSZ2G091 as seen by the NIKA2 camera
Authors:
E. Artis,
R. Adam,
P. Ade,
H. Ajeddig,
P. André,
M. Arnaud,
H. Aussel,
I. Bartalucci,
A. Beelen,
A. Benoît,
S. Berta,
L. Bing,
O. Bourrion,
M. Calvo,
A. Catalano,
M. De Petris,
F. -X. Désert,
S. Doyle,
E. F. C. Driessen,
A. Ferragamo,
A. Gomez,
J. Goupy,
C. Hanser,
F. Kéruzoré,
C. Kramer
, et al. (27 additional authors not shown)
Abstract:
PSZ2 G091.83+26.11 is a galaxy cluster with M500 = 7.43 x 10^14 Msun at z = 0.822 1. This object exhibits a complex morphology with a clear bimodality observed in X-rays. However, it was detected and analysed in the Planck sample as a single, spherical cluster following a universal profile 2. This model can lead to miscalculations of thermodynamical quantities, like the pressure profile. As future…
▽ More
PSZ2 G091.83+26.11 is a galaxy cluster with M500 = 7.43 x 10^14 Msun at z = 0.822 1. This object exhibits a complex morphology with a clear bimodality observed in X-rays. However, it was detected and analysed in the Planck sample as a single, spherical cluster following a universal profile 2. This model can lead to miscalculations of thermodynamical quantities, like the pressure profile. As future multiwavelength cluster experiments will detect more and more objects at high redshifts, it is crucial to quantify this systematic effect. In this work, we use high-resolution observations of the NIKA2 camera3,4,5,6 to integrate the morphological characteristics of the cluster in our modelling. This is achieved by fitting a two-halo model to the SZ image and then by reconstruction of the resulting projected pressure profile. We then compare these results with the spherical assumption.
△ Less
Submitted 29 April, 2022;
originally announced April 2022.
-
IAM: A Comprehensive and Large-Scale Dataset for Integrated Argument Mining Tasks
Authors:
Liying Cheng,
Lidong Bing,
Ruidan He,
Qian Yu,
Yan Zhang,
Luo Si
Abstract:
Traditionally, a debate usually requires a manual preparation process, including reading plenty of articles, selecting the claims, identifying the stances of the claims, seeking the evidence for the claims, etc. As the AI debate attracts more attention these years, it is worth exploring the methods to automate the tedious process involved in the debating system. In this work, we introduce a compre…
▽ More
Traditionally, a debate usually requires a manual preparation process, including reading plenty of articles, selecting the claims, identifying the stances of the claims, seeking the evidence for the claims, etc. As the AI debate attracts more attention these years, it is worth exploring the methods to automate the tedious process involved in the debating system. In this work, we introduce a comprehensive and large dataset named IAM, which can be applied to a series of argument mining tasks, including claim extraction, stance classification, evidence extraction, etc. Our dataset is collected from over 1k articles related to 123 topics. Near 70k sentences in the dataset are fully annotated based on their argument properties (e.g., claims, stances, evidence, etc.). We further propose two new integrated argument mining tasks associated with the debate preparation process: (1) claim extraction with stance classification (CESC) and (2) claim-evidence pair extraction (CEPE). We adopt a pipeline approach and an end-to-end method for each integrated task separately. Promising experimental results are reported to show the values and challenges of our proposed tasks, and motivate future research on argument mining.
△ Less
Submitted 16 July, 2022; v1 submitted 23 March, 2022;
originally announced March 2022.
-
Document-Level Relation Extraction with Adaptive Focal Loss and Knowledge Distillation
Authors:
Qingyu Tan,
Ruidan He,
Lidong Bing,
Hwee Tou Ng
Abstract:
Document-level Relation Extraction (DocRE) is a more challenging task compared to its sentence-level counterpart. It aims to extract relations from multiple sentences at once. In this paper, we propose a semi-supervised framework for DocRE with three novel components. Firstly, we use an axial attention module for learning the interdependency among entity-pairs, which improves the performance on tw…
▽ More
Document-level Relation Extraction (DocRE) is a more challenging task compared to its sentence-level counterpart. It aims to extract relations from multiple sentences at once. In this paper, we propose a semi-supervised framework for DocRE with three novel components. Firstly, we use an axial attention module for learning the interdependency among entity-pairs, which improves the performance on two-hop relations. Secondly, we propose an adaptive focal loss to tackle the class imbalance problem of DocRE. Lastly, we use knowledge distillation to overcome the differences between human annotated data and distantly supervised data. We conducted experiments on two DocRE datasets. Our model consistently outperforms strong baselines and its performance exceeds the previous SOTA by 1.36 F1 and 1.46 Ign_F1 score on the DocRED leaderboard. Our code and data will be released at https://github.com/tonytan48/KD-DocRE.
△ Less
Submitted 21 March, 2022;
originally announced March 2022.
-
RelationPrompt: Leveraging Prompts to Generate Synthetic Data for Zero-Shot Relation Triplet Extraction
Authors:
Yew Ken Chia,
Lidong Bing,
Soujanya Poria,
Luo Si
Abstract:
Despite the importance of relation extraction in building and representing knowledge, less research is focused on generalizing to unseen relations types. We introduce the task setting of Zero-Shot Relation Triplet Extraction (ZeroRTE) to encourage further research in low-resource relation extraction methods. Given an input sentence, each extracted triplet consists of the head entity, relation labe…
▽ More
Despite the importance of relation extraction in building and representing knowledge, less research is focused on generalizing to unseen relations types. We introduce the task setting of Zero-Shot Relation Triplet Extraction (ZeroRTE) to encourage further research in low-resource relation extraction methods. Given an input sentence, each extracted triplet consists of the head entity, relation label, and tail entity where the relation label is not seen at the training stage. To solve ZeroRTE, we propose to synthesize relation examples by prompting language models to generate structured texts. Concretely, we unify language model prompts and structured text approaches to design a structured prompt template for generating synthetic relation samples when conditioning on relation label prompts (RelationPrompt). To overcome the limitation for extracting multiple relation triplets in a sentence, we design a novel Triplet Search Decoding method. Experiments on FewRel and Wiki-ZSL datasets show the efficacy of RelationPrompt for the ZeroRTE task and zero-shot relation classification. Our code and data are available at github.com/declare-lab/RelationPrompt.
△ Less
Submitted 17 March, 2022;
originally announced March 2022.
-
A Survey on Aspect-Based Sentiment Analysis: Tasks, Methods, and Challenges
Authors:
Wenxuan Zhang,
Xin Li,
Yang Deng,
Lidong Bing,
Wai Lam
Abstract:
As an important fine-grained sentiment analysis problem, aspect-based sentiment analysis (ABSA), aiming to analyze and understand people's opinions at the aspect level, has been attracting considerable interest in the last decade. To handle ABSA in different scenarios, various tasks are introduced for analyzing different sentiment elements and their relations, including the aspect term, aspect cat…
▽ More
As an important fine-grained sentiment analysis problem, aspect-based sentiment analysis (ABSA), aiming to analyze and understand people's opinions at the aspect level, has been attracting considerable interest in the last decade. To handle ABSA in different scenarios, various tasks are introduced for analyzing different sentiment elements and their relations, including the aspect term, aspect category, opinion term, and sentiment polarity. Unlike early ABSA works focusing on a single sentiment element, many compound ABSA tasks involving multiple elements have been studied in recent years for capturing more complete aspect-level sentiment information. However, a systematic review of various ABSA tasks and their corresponding solutions is still lacking, which we aim to fill in this survey. More specifically, we provide a new taxonomy for ABSA which organizes existing studies from the axes of concerned sentiment elements, with an emphasis on recent advances of compound ABSA tasks. From the perspective of solutions, we summarize the utilization of pre-trained language models for ABSA, which improved the performance of ABSA to a new stage. Besides, techniques for building more practical ABSA systems in cross-domain/lingual scenarios are discussed. Finally, we review some emerging topics and discuss some open challenges to outlook potential future directions of ABSA.
△ Less
Submitted 6 November, 2022; v1 submitted 2 March, 2022;
originally announced March 2022.
-
Enhancing Cross-lingual Prompting with Dual Prompt Augmentation
Authors:
Meng Zhou,
Xin Li,
Yue Jiang,
Lidong Bing
Abstract:
Prompting shows promising results in few-shot scenarios. However, its strength for multilingual/cross-lingual problems has not been fully exploited. Zhao and Schütze (2021) made initial explorations in this direction by presenting that cross-lingual prompting outperforms cross-lingual finetuning. In this paper, we conduct an empirical exploration on the effect of each component in cross-lingual pr…
▽ More
Prompting shows promising results in few-shot scenarios. However, its strength for multilingual/cross-lingual problems has not been fully exploited. Zhao and Schütze (2021) made initial explorations in this direction by presenting that cross-lingual prompting outperforms cross-lingual finetuning. In this paper, we conduct an empirical exploration on the effect of each component in cross-lingual prompting and derive language-agnostic Universal Prompting, which helps alleviate the discrepancies between source-language training and target-language inference. Based on this, we propose DPA, a dual prompt augmentation framework, aiming at relieving the data scarcity issue in few-shot cross-lingual prompting. Notably, for XNLI, our method achieves 46.54% with only 16 English training examples per class, significantly better than 34.99% of finetuning. Our code is available at https://github.com/DAMO-NLP-SG/DPA.
△ Less
Submitted 24 May, 2023; v1 submitted 15 February, 2022;
originally announced February 2022.
-
WISE view of changing-look AGNs: evidence for a transitional stage of AGNs
Authors:
Lyu Bing,
Wu Qingwen,
Yan Zhen,
Yu Wenfei,
Liu Hao
Abstract:
The discovery of changing-look active galactic nuclei (CLAGNs) with the significant change of optical broad emission lines (optical CLAGNs) and/or strong variation of line-of-sight column densities (X-ray CLAGNs) challenges the orientation-based AGN unification model. We explore mid-infrared (mid-IR) properties for a sample of 57 optical CLAGNs and 11 X-ray CLAGNs based on the {\it Wide-field Infr…
▽ More
The discovery of changing-look active galactic nuclei (CLAGNs) with the significant change of optical broad emission lines (optical CLAGNs) and/or strong variation of line-of-sight column densities (X-ray CLAGNs) challenges the orientation-based AGN unification model. We explore mid-infrared (mid-IR) properties for a sample of 57 optical CLAGNs and 11 X-ray CLAGNs based on the {\it Wide-field Infrared Survey Explorer} ({\it WISE}) archive data. We find that Eddington-scaled mid-IR luminosities of both optical and X-ray CLAGNs stay just between low-luminosity AGNs (LLAGNs) and luminous QSOs. The average Eddington-scaled mid-IR luminosities for optical and X-ray CLAGNs are $\sim 0.4$\% and $\sim 0.5$\%, respectively, which roughly correspond the bolometric luminosity of transition between a radiatively inefficient accretion flow (RIAF) and Shakura-Sunyaev disk (SSD). We estimate the time lags of the variation in the mid-IR behind that in the optical band for 13 CLAGNs with strong mid-IR variability, where the tight correlation between the time lag and the bolometric luminosity ($τ- L$) for CLAGNs roughly follows that found in the luminous QSOs.
△ Less
Submitted 6 February, 2022;
originally announced February 2022.
-
Probing the role of magnetic fields in star-forming filaments: NIKA2-Pol commissioning results toward OMC-1
Authors:
H. Ajeddig,
R. Adam,
P. Ade,
P. André,
E. Artis,
H. Aussel,
A. Beelen,
A. Benoît,
S. Berta,
L. Bing,
O. Bourrion,
M. Calvo,
A. Catalano,
M. De Petris,
F. -X. Désert,
S. Doyle,
E. F. C. Driessen,
A. Gomez,
J. Goupy,
F. Kéruzoré,
C. Kramer,
B. Ladjelate,
G. Lagache,
S. Leclercq,
J. -F. Lestrade
, et al. (21 additional authors not shown)
Abstract:
Dust polarization observations are a powerful, practical tool to probe the geometry (and to some extent, the strength) of magnetic fields in star-forming regions. In particular, Planck polarization data have revealed the importance of magnetic fields on large scales in molecular clouds. However, due to insufficient resolution, Planck observations are unable to constrain the B-field geometry on pre…
▽ More
Dust polarization observations are a powerful, practical tool to probe the geometry (and to some extent, the strength) of magnetic fields in star-forming regions. In particular, Planck polarization data have revealed the importance of magnetic fields on large scales in molecular clouds. However, due to insufficient resolution, Planck observations are unable to constrain the B-field geometry on prestellar and protostellar scales. The high angular resolution of 11.7 arcsec provided by NIKA2-Pol 1.15 mm polarimetric imaging, corresponding to $\sim$ 0.02 pc at the distance of the Orion molecular cloud (OMC), makes it possible to advance our understanding of the B-field morphology in star-forming filaments and dense cores (IRAM 30m large program B-FUN). The commissioning of the NIKA2-Pol instrument has led to several challenging issues, in particular, the instrumental polarization or intensity-to-polarization (leakage) effect. In the present paper, we illustrate how this effect can be corrected for, leading to reliable exploitable data in a structured, extended source such as OMC-1. We present a statistical comparison between NIKA2-Pol and SCUBA2-Pol2 results in the OMC-1 region. We also present tentative evidence of local pinching of the B-field lines near Orion-KL, in the form of a new small-scale hourglass pattern, in addition to the larger-scale hourglass already seen by other instruments such as Pol2.
△ Less
Submitted 29 November, 2021;
originally announced November 2021.
-
Enhancing Multilingual Language Model with Massive Multilingual Knowledge Triples
Authors:
Linlin Liu,
Xin Li,
Ruidan He,
Lidong Bing,
Shafiq Joty,
Luo Si
Abstract:
Knowledge-enhanced language representation learning has shown promising results across various knowledge-intensive NLP tasks. However, prior methods are limited in efficient utilization of multilingual knowledge graph (KG) data for language model (LM) pretraining. They often train LMs with KGs in indirect ways, relying on extra entity/relation embeddings to facilitate knowledge injection. In this…
▽ More
Knowledge-enhanced language representation learning has shown promising results across various knowledge-intensive NLP tasks. However, prior methods are limited in efficient utilization of multilingual knowledge graph (KG) data for language model (LM) pretraining. They often train LMs with KGs in indirect ways, relying on extra entity/relation embeddings to facilitate knowledge injection. In this work, we explore methods to make better use of the multilingual annotation and language agnostic property of KG triples, and present novel knowledge based multilingual language models (KMLMs) trained directly on the knowledge triples. We first generate a large amount of multilingual synthetic sentences using the Wikidata KG triples. Then based on the intra- and inter-sentence structures of the generated data, we design pretraining tasks to enable the LMs to not only memorize the factual knowledge but also learn useful logical patterns. Our pretrained KMLMs demonstrate significant performance improvements on a wide range of knowledge-intensive cross-lingual tasks, including named entity recognition (NER), factual knowledge retrieval, relation classification, and a newly designed logical reasoning task.
△ Less
Submitted 18 October, 2022; v1 submitted 21 November, 2021;
originally announced November 2021.
-
PSZ2G091:A massive double cluster at z=0.822 observed by the NIKA2 camera
Authors:
E. Artis,
R. Adam,
P. Ade,
H. Ajeddig,
P. André,
M. Arnaud,
H. Aussel,
I. Bartalucci,
A. Beelen,
A. Benoît,
S. Berta,
L. Bing,
O. Bourrion,
M. Calvo,
A. Catalano,
M. De Petris,
F. -X. Désert,
S. Doyle,
E. F. C. Driessen,
A. Ferragamo,
A. Gomez,
J. Goupy,
F. Kéruzoré,
C. Kramer,
B. Ladjelate
, et al. (26 additional authors not shown)
Abstract:
PSZ2 G091.83+26.11 is a massive galaxy cluster with M500 = 7.43 x 10^14 Msun at z = 0.822. This object exhibits a complex morphology with a clear bimodality observed in X-rays. However, it was detected and analysed in the Planck sample as a single, spherical cluster following a universal profile [1]. This model can lead to miscalculations of thermodynamical quantities, like the pressure profile. A…
▽ More
PSZ2 G091.83+26.11 is a massive galaxy cluster with M500 = 7.43 x 10^14 Msun at z = 0.822. This object exhibits a complex morphology with a clear bimodality observed in X-rays. However, it was detected and analysed in the Planck sample as a single, spherical cluster following a universal profile [1]. This model can lead to miscalculations of thermodynamical quantities, like the pressure profile. As future multiwavelength cluster experiments will detect more and more objects at higher redshifts (where we expect the fraction of merging objects to be higher), it is crucial to quantify this systematic effect. In this work, we use high-resolution observations of PSZ2 G091.83+26.11 by the NIKA2 camera to integrate the morphological characteristics of the cluster in our modelling. This is achieved by fitting a two-halo model to the SZ image and then by reconstruction of the resulting projected pressure profile. We then compare these results with the spherical assumption.
△ Less
Submitted 9 November, 2021;
originally announced November 2021.
-
Dust Emission in Galaxies at Millimeter Wavelengths: Cooling of star forming regions in NGC6946
Authors:
G. Ejlali,
R. Adam,
P. Ade,
H. Ajeddig,
P. André,
E. Artis,
H. Ausse,
A. Beelen,
A. Benoît,
S. Berta,
L. Bing,
O. Bourrion,
M. Calvo,
A. Catalano,
I. de Looze,
M. De Petris,
F. -X. Désert,
S. Doyle,
E. F. C. Driessen,
M. Galametz,
F. Galliano,
A. Gomez,
J. Goupy,
A. P. Jones,
A. Hughes
, et al. (32 additional authors not shown)
Abstract:
Interstellar dust plays an important role in the formation of molecular gas and the heating and cooling of the interstellar medium. The spatial distribution of the mm-wavelength dust emission from galaxies is largely unexplored. The NIKA2 Guaranteed Time Project IMEGIN (Interpreting the Millimeter Emission of Galaxies with IRAM and NIKA2) has recently mapped the mm emission in the grand design spi…
▽ More
Interstellar dust plays an important role in the formation of molecular gas and the heating and cooling of the interstellar medium. The spatial distribution of the mm-wavelength dust emission from galaxies is largely unexplored. The NIKA2 Guaranteed Time Project IMEGIN (Interpreting the Millimeter Emission of Galaxies with IRAM and NIKA2) has recently mapped the mm emission in the grand design spiral galaxy NGC6946. By subtracting the contributions from the free-free, synchrotron, and CO line emission, we map the distribution of the pure dust emission at 1:15mm and 2mm. Separating the arm/interarm regions, we find a dominant 2mm emission from interarms indicating the significant role of the general interstellar radiation field in heating the cold dust. Finally, we present maps of the dust mass, temperature, and emissivity index using the Bayesian MCMC modeling of the spectral energy distribution in NGC6946.
△ Less
Submitted 6 November, 2021;
originally announced November 2021.
-
Galactic star formation with NIKA2 (GASTON): Filament convergence and its link to star formation
Authors:
N. Peretto,
R. Adam,
P. Ade,
H. Ajeddig,
P. André,
E. Artis,
H. Aussel,
A. Bacmann,
A. Beelen,
A. Benoît,
S. Berta,
L. Bing,
O. Bourrion,
M. Calvo,
A. catalano,
M. De Petris,
F. -X. Désert,
S. Doyle,
E. F. C. Driessen,
A. Gomez,
J. Goupy,
F. Kéruzoré,
C. Kramer,
B. Ladjelate,
G. Lagache
, et al. (23 additional authors not shown)
Abstract:
In the past decade filaments have been recognised as a major structural element of the interstellar medium, the densest of these filaments hosting the formation of most stars. In some star-forming molecular clouds converging networks of filaments, also known as hub filament systems, can be found. These hubs are believed to be preferentially associated to massive star formation. As of today, there…
▽ More
In the past decade filaments have been recognised as a major structural element of the interstellar medium, the densest of these filaments hosting the formation of most stars. In some star-forming molecular clouds converging networks of filaments, also known as hub filament systems, can be found. These hubs are believed to be preferentially associated to massive star formation. As of today, there are no metrics that allow the systematic quantification of a filament network convergence. Here, we used the IRAM 30m NIKA2 observations of the Galactic plane from the GASTON large programme to systematically identify filaments and produce a filament convergence parameter map. We use such a map to show that: i. hub filaments represent a small fraction of the global filament population; ii. hubs host, in proportion, more massive and more luminous compact sources that non-hubs; iii. hub-hosting clumps are more evolved that non-hubs; iv. no discontinuities are observed in the properties of compact sources as a function of convergence parameter. We propose that the rapid global collapse of clumps is responsible for (re)organising filament networks into hubs and, in parallel, enhancing the mass growth of compact sources.
△ Less
Submitted 5 November, 2021;
originally announced November 2021.
-
Crab nebula at 260 GHz with the NIKA2 polarimeter. Implications for the polarization angle calibration of future CMB experiments
Authors:
A. Ritacco,
R. Adam,
P. Ade,
H. Ajeddig,
P. André,
E. Artis,
J. Aumont,
H. Aussel,
A. Beelen,
A. Benoît,
S. Berta,
L. Bing,
O. Bourrion,
M. Calvo,
A. Catalano,
M. De Petris,
F. -X. Désert,
S. Doyle,
E. F. C. Driessen,
A. Gomez,
J. Goupy,
F. Kéruzoré,
C. Kramer,
B. Ladjelate,
G. Lagache
, et al. (21 additional authors not shown)
Abstract:
The quest for primordial gravitational waves enclosed in the Cosmic Microwave Background (CMB) polarization B-modes signal motivates the development of a new generation of high sensitive experiments (e.g. CMB-S4, LiteBIRD) that would allow them to detect its imprint.Neverthless, this will be only possible by ensuring a high control of the instrumental systematic effects and an accurate absolute ca…
▽ More
The quest for primordial gravitational waves enclosed in the Cosmic Microwave Background (CMB) polarization B-modes signal motivates the development of a new generation of high sensitive experiments (e.g. CMB-S4, LiteBIRD) that would allow them to detect its imprint.Neverthless, this will be only possible by ensuring a high control of the instrumental systematic effects and an accurate absolute calibration of the polarization angle. The Crab nebula is known to be a polarization calibrator on the sky for CMB experiments, already used for the Planck satellite it exhibits a high polarized signal at microwave wavelengths. In this work we present Crab polarization observations obtained at the central frequency of 260 GHz with the NIKA2 instrument and discuss the accuracy needed on such a measurement to improve the constraints on the absolute angle calibration for CMB experiments.
△ Less
Submitted 3 November, 2021;
originally announced November 2021.
-
Overdensity of SubMillimiter Galaxies in the GJ526 Field mapped with the NIKA2 Camera
Authors:
J. -F. Lestrade,
R. Adam,
P. Ade,
H. Ajeddig,
P. Andre,
E. Artis,
H. Aussel,
A. Beelen,
A. Benoit,
S. Berta,
L. Bing,
O. Bourrion,
M. Calvo,
A. Catalano,
A. Coulais,
M. De Petris,
F. -X. Desert,
S. Doyle,
E. F. C. Driessen,
A. Gomez,
J. Goupy,
F. Keruzore,
C. Kramer,
B. Ladjelate,
G. Lagache
, et al. (21 additional authors not shown)
Abstract:
Using the NIKA2 dual band millimeter camera installed on the IRAM30m telescope, we have mapped a relatively large field (~70 arcmin^2) in the direction of the star GJ526 to investigate the nature of the sources found with the MAMBO camera at 1.2 mm ten years earlier. We have found that they must be dust-obscured galaxies (SMGs) in the background beyond the star. The new NIKA2 map at 1.15 mm reveal…
▽ More
Using the NIKA2 dual band millimeter camera installed on the IRAM30m telescope, we have mapped a relatively large field (~70 arcmin^2) in the direction of the star GJ526 to investigate the nature of the sources found with the MAMBO camera at 1.2 mm ten years earlier. We have found that they must be dust-obscured galaxies (SMGs) in the background beyond the star. The new NIKA2 map at 1.15 mm reveals additional sources and, in fact, an overdensity of SMGs predominantly distributed along a filament-like structure in projection on the sky across the whole observed field. We speculate this might be a cosmic filament at high redshift as revealed in cosmological hydrodynamical simulations. Measurement of spectroscopic redshifts of the SMGs in the candidate filament is required now for a definitive confirmation of the nature of the structure.
△ Less
Submitted 2 November, 2021;
originally announced November 2021.
-
Exploring the millimetre emission in nearby galaxies: analysis of the edge-on galaxy NGC 891
Authors:
S. Katsioli,
R. Adam,
P. Ade,
H. Ajeddig,
P. André,
E. Artis,
H. Aussel,
A. Beelen,
A. Benoît,
S. Berta,
L. Bing,
O. Bourrion,
M. Calvo,
A. Catalano,
I. De Looze,
M. De Petris,
F. -X. Désert,
S. Doyle,
E. F. C. Driessen,
G. Ejlali,
M. Galametz,
F. Galliano,
A. Gomez,
J. Goupy,
A. P. Jones
, et al. (32 additional authors not shown)
Abstract:
New observations of the edge-on galaxy NGC 891, at 1.15 and 2 mm obtained with the IRAM 30-m telescope and the NIKA2 camera, within the framework of the IMEGIN (Interpreting the Millimetre Emission of Galaxies with IRAM and NIKA2) Large Program, are presented in this work. By using multiwavelength maps (from the mid-IR to the cm wavelengths) we perform SED fitting in order to extract the physical…
▽ More
New observations of the edge-on galaxy NGC 891, at 1.15 and 2 mm obtained with the IRAM 30-m telescope and the NIKA2 camera, within the framework of the IMEGIN (Interpreting the Millimetre Emission of Galaxies with IRAM and NIKA2) Large Program, are presented in this work. By using multiwavelength maps (from the mid-IR to the cm wavelengths) we perform SED fitting in order to extract the physical properties of the galaxy on both global and local ($\sim$kpc) scales. For the interpretation of the observations we make use of a state-of-the-art SED fitting code, HerBIE (HiERarchical Bayesian Inference for dust Emission). The observations indicate a galaxy morphology, at mm wavelengths, similar to that of the cold dust emission traced by sub-mm observations and to that of the molecular gas. The contribution of the radio emission at the NIKA2 bands is very small (negligible at 1.15 mm and $\sim10\%$ at 2 mm) while it dominates the total energy budget at longer wavelengths (beyond 5 mm). On local scales, the distribution of the free-free emission resembles that of the dust thermal emission while the distribution of the synchrotron emission shows a deficiency along the major axis of the disc of the galaxy.
△ Less
Submitted 2 November, 2021;
originally announced November 2021.
-
The NIKA2 Sunyaev-Zeldovich Large Program
Authors:
L. Perotto,
R. Adam,
P. Ade,
H. Ajeddig,
P. André,
M. Arnaud,
E. Artis,
H. Aussel,
I. Bartalucci,
A. Beelen,
A. Benoît,
S. Berta,
L. Bing,
O. Bourrion,
M. Calvo,
A. Catalano,
M. De Petris,
F. -X. Désert,
S. Doyle,
E. F. C. Driessen,
A. Ferragamo,
A. Gomez,
J. Goupy,
F. Kéruzoré,
C. Kramer
, et al. (26 additional authors not shown)
Abstract:
The NIKA2 Guaranteed-Time SZ Large Program (LPSZ) is dedicated to the high-angular resolution SZ map** of a representative sample of 45 SZ-selected galaxy clusters drawn from the catalogues of the Planck satellite, or of the Atacama Cosmology Telescope. The LPSZ sample spans a mass range from $3$ to $11 \times 10^{14} M_{\odot}$ and a redshift range from $0.5$ to $0.9$, extending to higher redsh…
▽ More
The NIKA2 Guaranteed-Time SZ Large Program (LPSZ) is dedicated to the high-angular resolution SZ map** of a representative sample of 45 SZ-selected galaxy clusters drawn from the catalogues of the Planck satellite, or of the Atacama Cosmology Telescope. The LPSZ sample spans a mass range from $3$ to $11 \times 10^{14} M_{\odot}$ and a redshift range from $0.5$ to $0.9$, extending to higher redshift and lower mass the previous samples dedicated to the cluster mass calibration and universal properties estimation. The main goals of the LPSZ are the measurement of the average radial profile of the ICM pressure up to $R_{500}$ by combining NIKA2 with Planck or ACT data, and the estimation of the scaling law between the SZ observable and the mass using NIKA2, XMM-Newton and Planck/ACT data. Furthermore, combining LPSZ data with existing or forthcoming public data in lensing, optical/NIR or radio domains, we will build a consistent picture of the cluster physics and further gain knowledge on the mass estimate as a function of the cluster morphology and dynamical state.
△ Less
Submitted 2 November, 2021;
originally announced November 2021.
-
The LPSZ-CLASH galaxy cluster sample: combining lensing and hydrostatic mass estimates
Authors:
M. Muñoz-Echeverría,
R. Adam,
P. Ade,
H. Ajeddig,
P. André,
M. Arnaud,
E. Artis,
H. Aussel,
I. Bartalucci,
A. Beelen,
A. Benoît,
S. Berta,
L. Bing,
O. Bourrion,
M. Calvo,
A. Catalano,
M. De Petris,
F. -X. Désert,
S. Doyle,
E. F. C. Driessen,
A. Ferragamo,
A. Gomez,
J. Goupy,
F. Kéruzoré,
C. Kramer
, et al. (26 additional authors not shown)
Abstract:
Starting from the clusters included in the NIKA sample and in the NIKA2 Sunyaev-Zel'dovich Large Program (LPSZ) we have selected a sample of six common objects with the Cluster Lensing And Supernova survey with Hubble (CLASH) lensing data. For the LPSZ clusters we have at our disposal both high-angular resolution observations of the thermal SZ with NIKA and NIKA2 and X-ray observations with XMM-Ne…
▽ More
Starting from the clusters included in the NIKA sample and in the NIKA2 Sunyaev-Zel'dovich Large Program (LPSZ) we have selected a sample of six common objects with the Cluster Lensing And Supernova survey with Hubble (CLASH) lensing data. For the LPSZ clusters we have at our disposal both high-angular resolution observations of the thermal SZ with NIKA and NIKA2 and X-ray observations with XMM-Newton from which hydrostatic mass estimates can be derived. In addition, the CLASH dataset includes lensing convergence maps that can be converted into lensing estimates of the total mass of the cluster. One-dimensional mass profiles are used to derive integrated mass estimates accounting for systematic effects (data processing, modeling, etc.). Two-dimensional analysis of the maps can reveal substructures in the cluster and, therefore, inform us about the dynamical state of each system. Moreover, we are able to study the hydrostatic mass to lensing mass bias, across different morphology and a range of redshift clusters to give more insight on the hydrostatic mass bias. The analysis presented in this proceeding follows the study discussed in Ferragamo et al. 2021.
△ Less
Submitted 2 November, 2021;
originally announced November 2021.