-
Efficient Grammatical Error Correction Via Multi-Task Training and Optimized Training Schedule
Authors:
Andrey Bout,
Alexander Podolskiy,
Sergey Nikolenko,
Irina Piontkovskaya
Abstract:
Progress in neural grammatical error correction (GEC) is hindered by the lack of annotated training data. Sufficient amounts of high-quality manually annotated data are not available, so recent research has relied on generating synthetic data, pretraining on it, and then fine-tuning on real datasets; performance gains have been achieved either by ensembling or by using huge pretrained models such…
▽ More
Progress in neural grammatical error correction (GEC) is hindered by the lack of annotated training data. Sufficient amounts of high-quality manually annotated data are not available, so recent research has relied on generating synthetic data, pretraining on it, and then fine-tuning on real datasets; performance gains have been achieved either by ensembling or by using huge pretrained models such as XXL-T5 as the backbone. In this work, we explore an orthogonal direction: how to use available data more efficiently. First, we propose auxiliary tasks that exploit the alignment between the original and corrected sentences, such as predicting a sequence of corrections. We formulate each task as a sequence-to-sequence problem and perform multi-task training. Second, we discover that the order of datasets used for training and even individual instances within a dataset may have important effects on the final performance, so we set out to find the best training schedule. Together, these two ideas lead to significant improvements, producing results that improve state of the art with much smaller models; in particular, we outperform the best models based on T5-XXL (11B parameters) with a BART-based model (400M parameters).
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
GEC-DePenD: Non-Autoregressive Grammatical Error Correction with Decoupled Permutation and Decoding
Authors:
Konstantin Yakovlev,
Alexander Podolskiy,
Andrey Bout,
Sergey Nikolenko,
Irina Piontkovskaya
Abstract:
Grammatical error correction (GEC) is an important NLP task that is currently usually solved with autoregressive sequence-to-sequence models. However, approaches of this class are inherently slow due to one-by-one token generation, so non-autoregressive alternatives are needed. In this work, we propose a novel non-autoregressive approach to GEC that decouples the architecture into a permutation ne…
▽ More
Grammatical error correction (GEC) is an important NLP task that is currently usually solved with autoregressive sequence-to-sequence models. However, approaches of this class are inherently slow due to one-by-one token generation, so non-autoregressive alternatives are needed. In this work, we propose a novel non-autoregressive approach to GEC that decouples the architecture into a permutation network that outputs a self-attention weight matrix that can be used in beam search to find the best permutation of input tokens (with auxiliary {ins} tokens) and a decoder network based on a step-unrolled denoising autoencoder that fills in specific tokens. This allows us to find the token permutation after only one forward pass of the permutation network, avoiding autoregressive constructions. We show that the resulting network improves over previously known non-autoregressive methods for GEC and reaches the level of autoregressive methods that do not use language-specific synthetic data generation methods. Our results are supported by a comprehensive experimental validation on the ConLL-2014 and Write&Improve+LOCNESS datasets and an extensive ablation study that supports our architectural and algorithmic choices.
△ Less
Submitted 14 November, 2023;
originally announced November 2023.
-
Sinkhorn Transformations for Single-Query Postprocessing in Text-Video Retrieval
Authors:
Konstantin Yakovlev,
Gregory Polyakov,
Ilseyar Alimova,
Alexander Podolskiy,
Andrey Bout,
Sergey Nikolenko,
Irina Piontkovskaya
Abstract:
A recent trend in multimodal retrieval is related to postprocessing test set results via the dual-softmax loss (DSL). While this approach can bring significant improvements, it usually presumes that an entire matrix of test samples is available as DSL input. This work introduces a new postprocessing approach based on Sinkhorn transformations that outperforms DSL. Further, we propose a new postproc…
▽ More
A recent trend in multimodal retrieval is related to postprocessing test set results via the dual-softmax loss (DSL). While this approach can bring significant improvements, it usually presumes that an entire matrix of test samples is available as DSL input. This work introduces a new postprocessing approach based on Sinkhorn transformations that outperforms DSL. Further, we propose a new postprocessing setting that does not require access to multiple test queries. We show that our approach can significantly improve the results of state of the art models such as CLIP4Clip, BLIP, X-CLIP, and DRL, thus achieving a new state-of-the-art on several standard text-video retrieval datasets both with access to the entire test set and in the single-query setting.
△ Less
Submitted 14 November, 2023;
originally announced November 2023.
-
PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Authors:
Xiaozhe Ren,
**yi Zhou,
Xinfan Meng,
Xin**g Huang,
Yadao Wang,
Weichao Wang,
Pengfei Li,
Xiaoda Zhang,
Alexander Podolskiy,
Grigory Arshinov,
Andrey Bout,
Irina Piontkovskaya,
Jiansheng Wei,
Xin Jiang,
Teng Su,
Qun Liu,
Jun Yao
Abstract:
The scaling of large language models has greatly improved natural language understanding, generation, and reasoning. In this work, we develop a system that trained a trillion-parameter language model on a cluster of Ascend 910 AI processors and MindSpore framework, and present the language model with 1.085T parameters named PanGu-Σ. With parameter inherent from PanGu-α, we extend the dense Transfo…
▽ More
The scaling of large language models has greatly improved natural language understanding, generation, and reasoning. In this work, we develop a system that trained a trillion-parameter language model on a cluster of Ascend 910 AI processors and MindSpore framework, and present the language model with 1.085T parameters named PanGu-Σ. With parameter inherent from PanGu-α, we extend the dense Transformer model to sparse one with Random Routed Experts (RRE), and efficiently train the model over 329B tokens by using Expert Computation and Storage Separation(ECSS). This resulted in a 6.3x increase in training throughput through heterogeneous computing. Our experimental findings show that PanGu-Σ provides state-of-the-art performance in zero-shot learning of various Chinese NLP downstream tasks. Moreover, it demonstrates strong abilities when fine-tuned in application data of open-domain dialogue, question answering, machine translation and code generation.
△ Less
Submitted 19 March, 2023;
originally announced March 2023.
-
Template-based Approach to Zero-shot Intent Recognition
Authors:
Dmitry Lamanov,
Pavel Burnyshev,
Ekaterina Artemova,
Valentin Malykh,
Andrey Bout,
Irina Piontkovskaya
Abstract:
The recent advances in transfer learning techniques and pre-training of large contextualized encoders foster innovation in real-life applications, including dialog assistants. Practical needs of intent recognition require effective data usage and the ability to constantly update supported intents, adopting new ones, and abandoning outdated ones. In particular, the generalized zero-shot paradigm, i…
▽ More
The recent advances in transfer learning techniques and pre-training of large contextualized encoders foster innovation in real-life applications, including dialog assistants. Practical needs of intent recognition require effective data usage and the ability to constantly update supported intents, adopting new ones, and abandoning outdated ones. In particular, the generalized zero-shot paradigm, in which the model is trained on the seen intents and tested on both seen and unseen intents, is taking on new importance. In this paper, we explore the generalized zero-shot setup for intent recognition. Following best practices for zero-shot text classification, we treat the task with a sentence pair modeling approach. We outperform previous state-of-the-art f1-measure by up to 16\% for unseen intents, using intent labels and user utterances and without accessing external sources (such as knowledge bases). Further enhancement includes lexicalization of intent labels, which improves performance by up to 7\%. By using task transferring from other sentence pair tasks, such as Natural Language Inference, we gain additional improvements.
△ Less
Submitted 22 June, 2022;
originally announced June 2022.
-
A Single Example Can Improve Zero-Shot Data Generation
Authors:
Pavel Burnyshev,
Valentin Malykh,
Andrey Bout,
Ekaterina Artemova,
Irina Piontkovskaya
Abstract:
Sub-tasks of intent classification, such as robustness to distribution shift, adaptation to specific user groups and personalization, out-of-domain detection, require extensive and flexible datasets for experiments and evaluation. As collecting such datasets is time- and labor-consuming, we propose to use text generation methods to gather datasets. The generator should be trained to generate utter…
▽ More
Sub-tasks of intent classification, such as robustness to distribution shift, adaptation to specific user groups and personalization, out-of-domain detection, require extensive and flexible datasets for experiments and evaluation. As collecting such datasets is time- and labor-consuming, we propose to use text generation methods to gather datasets. The generator should be trained to generate utterances that belong to the given intent. We explore two approaches to generating task-oriented utterances. In the zero-shot approach, the model is trained to generate utterances from seen intents and is further used to generate utterances for intents unseen during training. In the one-shot approach, the model is presented with a single utterance from a test intent. We perform a thorough automatic, and human evaluation of the dataset generated utilizing two proposed approaches. Our results reveal that the attributes of the generated data are close to original test sets, collected via crowd-sourcing.
△ Less
Submitted 16 August, 2021;
originally announced August 2021.
-
Revisiting Mahalanobis Distance for Transformer-Based Out-of-Domain Detection
Authors:
Alexander Podolskiy,
Dmitry Lipin,
Andrey Bout,
Ekaterina Artemova,
Irina Piontkovskaya
Abstract:
Real-life applications, heavily relying on machine learning, such as dialog systems, demand out-of-domain detection methods. Intent classification models should be equipped with a mechanism to distinguish seen intents from unseen ones so that the dialog agent is capable of rejecting the latter and avoiding undesired behavior. However, despite increasing attention paid to the task, the best practic…
▽ More
Real-life applications, heavily relying on machine learning, such as dialog systems, demand out-of-domain detection methods. Intent classification models should be equipped with a mechanism to distinguish seen intents from unseen ones so that the dialog agent is capable of rejecting the latter and avoiding undesired behavior. However, despite increasing attention paid to the task, the best practices for out-of-domain intent detection have not yet been fully established.
This paper conducts a thorough comparison of out-of-domain intent detection methods. We prioritize the methods, not requiring access to out-of-domain data during training, gathering of which is extremely time- and labor-consuming due to lexical and stylistic variation of user utterances. We evaluate multiple contextual encoders and methods, proven to be efficient, on three standard datasets for intent classification, expanded with out-of-domain utterances. Our main findings show that fine-tuning Transformer-based encoders on in-domain data leads to superior results. Mahalanobis distance, together with utterance representations, derived from Transformer-based encoders, outperforms other methods by a wide margin and establishes new state-of-the-art results for all datasets.
The broader analysis shows that the reason for success lies in the fact that the fine-tuned Transformer is capable of constructing homogeneous representations of in-domain utterances, revealing geometrical disparity to out of domain utterances. In turn, the Mahalanobis distance captures this disparity easily.
The code is available in our GitHub repo: https://github.com/huawei-noah/noah-research/tree/master/Maha_OOD .
△ Less
Submitted 23 May, 2022; v1 submitted 11 January, 2021;
originally announced January 2021.
-
The University of Texas Millimeter Wave Observatory
Authors:
Paul A. Vanden Bout,
John H. Davis,
Robert B. Loren
Abstract:
This is an account of the Millimeter Wave Observatory, a 4.9 meter diameter antenna facility that pioneered continuum observations of planets and interstellar molecular spectroscopy from 1971 to 1988. The circumstances of its founding, development of its instrumentation, and major research contributions are discussed. The MWO role in training of personnel in this new field is illustrated by a list…
▽ More
This is an account of the Millimeter Wave Observatory, a 4.9 meter diameter antenna facility that pioneered continuum observations of planets and interstellar molecular spectroscopy from 1971 to 1988. The circumstances of its founding, development of its instrumentation, and major research contributions are discussed. The MWO role in training of personnel in this new field is illustrated by a listing of student and postdoctoral observers, with titles of PhD theses that included MWO data.
△ Less
Submitted 10 January, 2013;
originally announced January 2013.
-
CO(1-0) in z>2 Quasar Host Galaxies: No Evidence for Extended Molecular Gas Reservoirs
Authors:
Dominik A. Riechers,
Christopher L. Carilli,
Ronald J. Maddalena,
Jacqueline Hodge,
Andrew I. Harris,
Andrew J. Baker,
Fabian Walter,
Jeff Wagg,
Paul A. Vanden Bout,
Axel Weiss,
Chelsea E. Sharon
Abstract:
We report the detection of CO(1-0) emission in the strongly lensed high-redshift quasars IRAS F10214+4724 (z=2.286), the Cloverleaf (z=2.558), RX J0911+0551 (z=2.796), SMM J04135+10277 (z=2.846), and MG 0751+2716 (z=3.200), using the Expanded Very Large Array and the Green Bank Telescope. We report lensing-corrected CO(1-0) line luminosities of L'(CO) = 0.34-18.4 x 10^10 K km/s pc^2 and total mole…
▽ More
We report the detection of CO(1-0) emission in the strongly lensed high-redshift quasars IRAS F10214+4724 (z=2.286), the Cloverleaf (z=2.558), RX J0911+0551 (z=2.796), SMM J04135+10277 (z=2.846), and MG 0751+2716 (z=3.200), using the Expanded Very Large Array and the Green Bank Telescope. We report lensing-corrected CO(1-0) line luminosities of L'(CO) = 0.34-18.4 x 10^10 K km/s pc^2 and total molecular gas masses of M(H2) = 0.27-14.7 x 10^10 Msun for the sources in our sample. Based on CO line ratios relative to previously reported observations in J>=3 rotational transitions and line excitation modeling, we find that the CO(1-0) line strengths in our targets are consistent with single, highly-excited gas components with constant brightness temperature up to mid-J levels. We thus do not find any evidence for luminous extended, low excitation, low surface brightness molecular gas components. These properties are comparable to those found in z>4 quasars with existing CO(1-0) observations. These findings stand in contrast to recent CO(1-0) observations of z~2-4 submillimeter galaxies (SMGs), which have lower CO excitation and show evidence for multiple excitation components, including some low-excitation gas. These findings are consistent with the picture that gas-rich quasars and SMGs represent different stages in the early evolution of massive galaxies.
△ Less
Submitted 13 June, 2011;
originally announced June 2011.
-
Enhanced Dense Gas Fraction in Ultra-Luminous Infrared Galaxies
Authors:
S. Juneau,
D. T. Narayanan,
J. Moustakas,
Y. L. Shirley,
R. S. Bussmann,
R. C. Kennicutt Jr,
P. A. Vanden Bout
Abstract:
We present a detailed analysis of the relation between infrared luminosity and molecular line luminosity, for a variety of molecular transitions, using a sample of 34 nearby galaxies spanning a broad range of infrared luminosities (10^{10} < L_{IR} < 10^{12.5} L_sun). We show that the power-law index of the relation is sensitive to the critical density of the molecular gas tracer used, and that…
▽ More
We present a detailed analysis of the relation between infrared luminosity and molecular line luminosity, for a variety of molecular transitions, using a sample of 34 nearby galaxies spanning a broad range of infrared luminosities (10^{10} < L_{IR} < 10^{12.5} L_sun). We show that the power-law index of the relation is sensitive to the critical density of the molecular gas tracer used, and that the dominant driver in observed molecular line ratios in galaxies is the gas density. As most nearby ultraluminous infrared galaxies (ULIRGs) exhibit strong signatures of active galactic nuclei (AGN) in their center, we revisit previous claims questioning the reliability of HCN as a probe of the dense gas responsible for star formation in the presence of AGN. We find that the enhanced HCN(1-0)/CO(1-0) luminosity ratio observed in ULIRGs can be successfully reproduced using numerical models with fixed chemical abundances and without AGN-induced chemistry effects. We extend this analysis to a total of ten molecular line ratios by combining the following transitions: CO(1-0), HCO+(1-0), HCO+(3-2), HCN(1-0), and HCN(3-2). Our results suggest that AGNs reside in systems with higher dense gas fraction, and that chemistry or other effects associated with their hard radiation field may not dominate (NGC 1068 is one exception). Galaxy merger could be the underlying cause of increased dense gas fraction and the evolutionary stage of such mergers may be another determinant of the HCN/CO luminosity ratio.
△ Less
Submitted 3 December, 2009; v1 submitted 2 November, 2009;
originally announced November 2009.
-
The Star Formation Rate - Dense Gas Relation in Galaxies as Measured by HCN (3-2) Emission
Authors:
R. S. Bussmann,
D. Narayanan,
Y. L. Shirley,
S. Juneau,
J. Wu,
P. M. Solomon,
P. A. Vanden Bout,
J. Moustakas,
C. K. Walker
Abstract:
We present observations made with the 10m Heinrich Hertz Sub-Millimeter Telescope of HCN (3-2) emission from a sample of 30 nearby galaxies ranging in infrared luminosity from 10^10 - 10^12.5 L_sun and HCN (3-2) luminosity from 10^6 - 10^9 K km s^-1 pc^2. We examine the correlation between the infrared luminosity and HCN (3-2) luminosity and find that the best fit linear regression has a slope (…
▽ More
We present observations made with the 10m Heinrich Hertz Sub-Millimeter Telescope of HCN (3-2) emission from a sample of 30 nearby galaxies ranging in infrared luminosity from 10^10 - 10^12.5 L_sun and HCN (3-2) luminosity from 10^6 - 10^9 K km s^-1 pc^2. We examine the correlation between the infrared luminosity and HCN (3-2) luminosity and find that the best fit linear regression has a slope (in log-log space) of 0.74+/-0.12. Including recently published data from Gracia-Carpio et al. tightens the constraints on the best-fit slope to 0.79+/-0.09. This slope below unity suggests that the HCN (3-2) molecular line luminosity is not linearly tracing the amount of dense gas. Our results are consistent with predictions from recent theoretical models that find slopes below unity when the line luminosity depends upon the average gas density with a power-law index greater than a Kennicutt-Schmidt index of 1.5.
△ Less
Submitted 28 May, 2008;
originally announced May 2008.
-
VLA Limits for Intermediate Mass Black Holes in Three Globular Clusters
Authors:
F. N. Bash,
K. Gebhardt,
W. M. Goss,
P. A. Vanden Bout
Abstract:
The observational evidence for central black holes in globular clusters has been argued extensively, and their existence has important consequences for both the formation and evolution of the cluster. Most of the evidence comes from dynamical arguments, but the interpretation is difficult, given the short relaxation times and old ages of the clusters. One of the most robust signatures for the ex…
▽ More
The observational evidence for central black holes in globular clusters has been argued extensively, and their existence has important consequences for both the formation and evolution of the cluster. Most of the evidence comes from dynamical arguments, but the interpretation is difficult, given the short relaxation times and old ages of the clusters. One of the most robust signatures for the existence of a black hole is radio and/or X-ray emission. We observed three globular clusters, NGC6093 (M80), NGC6266 (M62), and NGC7078 (M15), with the VLA in the A and C configuration with a 3-sigma noise of 36, 36 and 25 microJy, respectively. We find no statistically-significant evidence for radio emission from the central region for any of the three clusters. NGC6266 shows a 2-sigma detection. It is difficult to infer a mass from these upper limits due to uncertainty about the central gas density, accretion rate, and accretion model.
△ Less
Submitted 1 October, 2007;
originally announced October 2007.
-
HCN Observations of Dense Star-Forming Gas in High Redshift Galaxies
Authors:
Yu Gao,
Chris L. Carilli,
Philip M. Solomon,
Paul A. Vanden Bout
Abstract:
We present here the sensitive HCN(1-0) observations made with the VLA of two submillimeter galaxies and two QSOs at high-redshift. HCN emission is the signature of dense molecular gas found in GMC cores, the actual sites of massive star formation. We have made the first detection of HCN in a submillimeter galaxy, SMM J16359+6612. The HCN emission is seen with a signal to noise ratio of 4$σ$ and…
▽ More
We present here the sensitive HCN(1-0) observations made with the VLA of two submillimeter galaxies and two QSOs at high-redshift. HCN emission is the signature of dense molecular gas found in GMC cores, the actual sites of massive star formation. We have made the first detection of HCN in a submillimeter galaxy, SMM J16359+6612. The HCN emission is seen with a signal to noise ratio of 4$σ$ and appears to be resolved as a double-source of $\approxlt 2''$ separation. Our new HCN observations, combined with previous HCN detections and upper limits, show that the FIR/HCN ratios in these high redshift sources lie systematically above the FIR/HCN correlation established for nearby galaxies by about a factor of 2. Even considering the scatter in the data and the presence of upper limits, this is an indication that the FIR/HCN ratios for the early Universe molecular emission line galaxies (EMGs) deviate from the correlation that fits Galactic giant molecular cloud cores, normal spirals, LIRGs, and ULIRGs. This indicates that the star formation rate per solar mass of dense molecular gas is higher in the high-$z$ objects than in local galaxies including normal spirals LIRGs and ULIRGs. The limited HCN detections at high-redshift show that the HCN/CO ratios for the high-$z$ objects are high and are comparable to those of the local ULIRGs rather than those of normal spirals. This indicates that EMGs have a high fraction of dense molecular gas compared to total molecular gas traced by CO emission.
△ Less
Submitted 21 March, 2007;
originally announced March 2007.
-
A New Probe of Dense Gas at High Redshift: Detection of HCO+(5-4) Line Emission in APM 08279+5255
Authors:
S. Garcia-Burillo,
J. Gracia-Carpio,
M. Guelin,
R. Neri,
P. Cox,
P. Planesas,
P. M. Solomon,
L. J. Tacconi,
P. A. Vanden Bout
Abstract:
We report the detection of HCO+(5-4) emission from the Broad Absorption Line (BAL) quasar APM08279+5255 at z=3.911 based on observations conducted at the IRAM Plateau de Bure interferometer. This represents the first detection of this molecular ion at such a high redshift. The inferred line luminosity, uncorrected for lensing, is L'(HCO+)=(3.5+-0.6)x10^10 Kkms^-1pc^2. The HCO+ J=5-4 source posit…
▽ More
We report the detection of HCO+(5-4) emission from the Broad Absorption Line (BAL) quasar APM08279+5255 at z=3.911 based on observations conducted at the IRAM Plateau de Bure interferometer. This represents the first detection of this molecular ion at such a high redshift. The inferred line luminosity, uncorrected for lensing, is L'(HCO+)=(3.5+-0.6)x10^10 Kkms^-1pc^2. The HCO+ J=5-4 source position coincides within the errors with that reported from previous HCN J=5-4 and high-J CO line observations of this quasar. The HCO+ line profile central velocity and width are consistent with those derived from HCN. This result suggests that HCO+(5-4) emission comes roughly from the same circumnuclear region probed by HCN. However, the HCN(5-4)/HCO+(5-4) intensity ratio measured in APM08279+5255 is significantly larger than that predicted by simple radiative transfer models, which assume collisional excitation and equal molecular abundances. This could imply that the [HCN]/[HCO^+] abundance ratio is particularly large in this source, or that the J=5 rotational levels are predominantly excited by IR fluorescent radiation.
△ Less
Submitted 25 May, 2006;
originally announced May 2006.
-
Connecting Dense Gas Tracers of Star Formation in our Galaxy to High-z Star Formation
Authors:
**gwen Wu,
Neal J. Evans II,
Yu Gao,
Philip M. Solomon,
Yancy L. Shirley,
Paul A. Vanden Bout
Abstract:
Observations have revealed prodigious amounts of star formation in starburst galaxies as traced by dust and molecular emission, even at large redshifts. Recent work shows that for both nearby spiral galaxies and distant starbursts, the global star formation rate, as indicated by the infrared luminosity, has a tight and almost linear correlation with the amount of dense gas as traced by the lumin…
▽ More
Observations have revealed prodigious amounts of star formation in starburst galaxies as traced by dust and molecular emission, even at large redshifts. Recent work shows that for both nearby spiral galaxies and distant starbursts, the global star formation rate, as indicated by the infrared luminosity, has a tight and almost linear correlation with the amount of dense gas as traced by the luminosity of HCN. Our surveys of Galactic dense cores in HCN 1-0 emission show that this correlation continues to a much smaller scale, with nearly the same ratio of infrared luminosity to HCN luminosity found over 7-8 orders of magnitude in L_IR, with a lower cutoff around 10^{4.5} L_sun of infrared luminosity. The linear correlation suggests that we may understand distant star formation in terms of the known properties of local star-forming regions. Both the correlation and the luminosity cutoff can be explained if the basic unit of star formation in galaxies is a dense core, similar to those studied in our Galaxy.
△ Less
Submitted 5 December, 2005; v1 submitted 15 November, 2005;
originally announced November 2005.
-
Molecular Gas at High Redshift
Authors:
P. M. Solomon,
P. A. Vanden Bout
Abstract:
The Early Universe Molecular Emission Line Galaxies (EMGs) are a population of galaxies with only 36 examples that hold great promise for the study of galaxy formation and evolution at high redshift. The classification, luminosity of molecular line emission, molecular mass, far-infrared (FIR) luminosity, star formation efficiency, morphology, and dynamical mass of the currently known sample are…
▽ More
The Early Universe Molecular Emission Line Galaxies (EMGs) are a population of galaxies with only 36 examples that hold great promise for the study of galaxy formation and evolution at high redshift. The classification, luminosity of molecular line emission, molecular mass, far-infrared (FIR) luminosity, star formation efficiency, morphology, and dynamical mass of the currently known sample are presented and discussed. The star formation rates derived from the FIR luminosity range from about 300 to 5000 M(sun)per year and the molecular mass from 4 x 10^9 to 1 x 10^{11} M(sun). At the lower end, these star formation rates, gas masses, and diameters are similar to those of local ultraluminous infrared galaxies, and represent starbursts in centrally concentrated disks, sometimes, but not always, associated with active galactic nuclei. The evidence for large (> 5 kpc) molecular disks is limited. Morphology and several high angular resolution images suggest that some EMGs are mergers with a massive molecular interstellar medium in both components. A critical question is whether the EMGs, in particular those at the higher end of the gas mass and luminosity distribution, represent the formation of massive, giant elliptical galaxies in the early Universe. The sample size is expected to grow explosively in the era of the Atacama Large Millimeter Array (ALMA).
△ Less
Submitted 23 August, 2005;
originally announced August 2005.
-
Origins of the ALMA Project in the scientific visions of the North American, European, and Japanese astronomical communities
Authors:
Paul A. Vanden Bout
Abstract:
ALMA is a worldwide project, the synthesis of early visions of astronomers in its three partner communities, Europe, North America, and Japan. The evolution of these concepts and their eventual merger into ALMA are discussed, setting the background for the papers which follow on the scientific requirements and expected performance of ALMA for extra-galactic, galactic, and solar system research.
ALMA is a worldwide project, the synthesis of early visions of astronomers in its three partner communities, Europe, North America, and Japan. The evolution of these concepts and their eventual merger into ALMA are discussed, setting the background for the papers which follow on the scientific requirements and expected performance of ALMA for extra-galactic, galactic, and solar system research.
△ Less
Submitted 2 November, 2004;
originally announced November 2004.
-
High Redshift HCN Emission: Dense Star-Forming Molecular Gas in IRAS F10214+4724
Authors:
P. A. Vanden Bout,
P. M. Solomon,
R. J. Maddalena
Abstract:
Hydrogen cyanide emission in the J=1-0 transition has been detected at redshift z=2.2858 in IRAS F10214+4724 using the Green Bank Telescope . This is the second detection of HCN emission at high redshift. The large HCN line luminosity in F10214 is similar to that in the Cloverleaf (z=2.6) and the ultra-luminous infrared galaxies Mrk231 and Arp220. This is also true of the ratio of HCN to CO lumi…
▽ More
Hydrogen cyanide emission in the J=1-0 transition has been detected at redshift z=2.2858 in IRAS F10214+4724 using the Green Bank Telescope . This is the second detection of HCN emission at high redshift. The large HCN line luminosity in F10214 is similar to that in the Cloverleaf (z=2.6) and the ultra-luminous infrared galaxies Mrk231 and Arp220. This is also true of the ratio of HCN to CO luminosities. The ratio of far-infrared luminosity to HCN luminosity, an indicator of the star formation rate per solar mass of dense gas, follows the correlation found for normal spirals and infrared luminous starburst galaxies. F10214 clearly contains a starburst that contributes, together with its embedded quasar, to its overall infrared luminosity. A new technique for removing spectral baselines in the search for weak, broad emission lines is presented.
△ Less
Submitted 9 September, 2004;
originally announced September 2004.