-
Shared Model of Sense-making for Human-Machine Collaboration
Authors:
Gheorghe Tecuci,
Dorin Marcu,
Louis Kaiser,
Mihai Boicu
Abstract:
We present a model of sense-making that greatly facilitates the collaboration between an intelligent analyst and a knowledge-based agent. It is a general model grounded in the science of evidence and the scientific method of hypothesis generation and testing, where sense-making hypotheses that explain an observation are generated, relevant evidence is then discovered, and the hypotheses are tested…
▽ More
We present a model of sense-making that greatly facilitates the collaboration between an intelligent analyst and a knowledge-based agent. It is a general model grounded in the science of evidence and the scientific method of hypothesis generation and testing, where sense-making hypotheses that explain an observation are generated, relevant evidence is then discovered, and the hypotheses are tested based on the discovered evidence. We illustrate how the model enables an analyst to directly instruct the agent to understand situations involving the possible production of weapons (e.g., chemical warfare agents) and how the agent becomes increasingly more competent in understanding other situations from that domain (e.g., possible production of centrifuge-enriched uranium or of stealth fighter aircraft).
△ Less
Submitted 5 November, 2021;
originally announced November 2021.
-
Toward a Computational Theory of Evidence-Based Reasoning for Instructable Cognitive Agents
Authors:
Gheorghe Tecuci,
Dorin Marcu,
Mihai Boicu,
Steven Meckl,
Chirag Uttamsingh
Abstract:
Evidence-based reasoning is at the core of many problem-solving and decision-making tasks in a wide variety of domains. Generalizing from the research and development of cognitive agents in several such domains, this paper presents progress toward a computational theory for the development of instructable cognitive agents for evidence-based reasoning tasks. The paper also illustrates the applicati…
▽ More
Evidence-based reasoning is at the core of many problem-solving and decision-making tasks in a wide variety of domains. Generalizing from the research and development of cognitive agents in several such domains, this paper presents progress toward a computational theory for the development of instructable cognitive agents for evidence-based reasoning tasks. The paper also illustrates the application of this theory to the development of four prototype cognitive agents in domains that are critical to the government and the public sector. Two agents function as cognitive assistants, one in intelligence analysis, and the other in science education. The other two agents operate autonomously, one in cybersecurity and the other in intelligence, surveillance, and reconnaissance. The paper concludes with the directions of future research on the proposed computational theory.
△ Less
Submitted 9 October, 2019;
originally announced October 2019.
-
Co-Arg: Cogent Argumentation with Crowd Elicitation
Authors:
Mihai Boicu,
Dorin Marcu,
Gheorghe Tecuci,
Lou Kaiser,
Chirag Uttamsingh,
Navya Kalale
Abstract:
This paper presents Co-Arg, a new type of cognitive assistant to an intelligence analyst that enables the synergistic integration of analyst imagination and expertise, computer knowledge and critical reasoning, and crowd wisdom, to draw defensible and persuasive conclusions from masses of evidence of all types, in a world that is changing all the time. Co-Arg's goal is to improve the quality of th…
▽ More
This paper presents Co-Arg, a new type of cognitive assistant to an intelligence analyst that enables the synergistic integration of analyst imagination and expertise, computer knowledge and critical reasoning, and crowd wisdom, to draw defensible and persuasive conclusions from masses of evidence of all types, in a world that is changing all the time. Co-Arg's goal is to improve the quality of the analytic results and enhance their understandability for both experts and novices. The performed analysis is based on a sound and transparent argumentation that links evidence to conclusions in a way that shows very clearly how the conclusions have been reached, what evidence was used and how, what is not known, and what assumptions have been made. The analytic results are presented in a report describes the analytic conclusion and its probability, the main favoring and disfavoring arguments, the justification of the key judgments and assumptions, and the missing information that might increase the accuracy of the solution.
△ Less
Submitted 2 October, 2018;
originally announced October 2018.
-
Learning Interpretable Spatial Operations in a Rich 3D Blocks World
Authors:
Yonatan Bisk,
Kevin J. Shih,
Ye** Choi,
Daniel Marcu
Abstract:
In this paper, we study the problem of map** natural language instructions to complex spatial actions in a 3D blocks world. We first introduce a new dataset that pairs complex 3D spatial operations to rich natural language descriptions that require complex spatial and pragmatic interpretations such as "mirroring", "twisting", and "balancing". This dataset, built on the simulation environment of…
▽ More
In this paper, we study the problem of map** natural language instructions to complex spatial actions in a 3D blocks world. We first introduce a new dataset that pairs complex 3D spatial operations to rich natural language descriptions that require complex spatial and pragmatic interpretations such as "mirroring", "twisting", and "balancing". This dataset, built on the simulation environment of Bisk, Yuret, and Marcu (2016), attains language that is significantly richer and more complex, while also doubling the size of the original dataset in the 2D environment with 100 new world configurations and 250,000 tokens. In addition, we propose a new neural architecture that achieves competitive results while automatically discovering an inventory of interpretable spatial operations (Figure 5)
△ Less
Submitted 24 December, 2017; v1 submitted 9 December, 2017;
originally announced December 2017.
-
Unsupervised Neural Hidden Markov Models
Authors:
Ke Tran,
Yonatan Bisk,
Ashish Vaswani,
Daniel Marcu,
Kevin Knight
Abstract:
In this work, we present the first results for neuralizing an Unsupervised Hidden Markov Model. We evaluate our approach on tag in- duction. Our approach outperforms existing generative models and is competitive with the state-of-the-art though with a simpler model easily extended to include additional context.
In this work, we present the first results for neuralizing an Unsupervised Hidden Markov Model. We evaluate our approach on tag in- duction. Our approach outperforms existing generative models and is competitive with the state-of-the-art though with a simpler model easily extended to include additional context.
△ Less
Submitted 28 September, 2016;
originally announced September 2016.
-
Extracting Biomolecular Interactions Using Semantic Parsing of Biomedical Text
Authors:
Sahil Garg,
Aram Galstyan,
Ulf Hermjakob,
Daniel Marcu
Abstract:
We advance the state of the art in biomolecular interaction extraction with three contributions: (i) We show that deep, Abstract Meaning Representations (AMR) significantly improve the accuracy of a biomolecular interaction extraction system when compared to a baseline that relies solely on surface- and syntax-based features; (ii) In contrast with previous approaches that infer relations on a sent…
▽ More
We advance the state of the art in biomolecular interaction extraction with three contributions: (i) We show that deep, Abstract Meaning Representations (AMR) significantly improve the accuracy of a biomolecular interaction extraction system when compared to a baseline that relies solely on surface- and syntax-based features; (ii) In contrast with previous approaches that infer relations on a sentence-by-sentence basis, we expand our framework to enable consistent predictions over sets of sentences (documents); (iii) We further modify and expand a graph kernel learning framework to enable concurrent exploitation of automatically induced AMR (semantic) and dependency structure (syntactic) representations. Our experiments show that our approach yields interaction extraction systems that are more robust in environments where there is a significant mismatch between training and test conditions.
△ Less
Submitted 4 December, 2015;
originally announced December 2015.
-
Using Syntax-Based Machine Translation to Parse English into Abstract Meaning Representation
Authors:
Michael Pust,
Ulf Hermjakob,
Kevin Knight,
Daniel Marcu,
Jonathan May
Abstract:
We present a parser for Abstract Meaning Representation (AMR). We treat English-to-AMR conversion within the framework of string-to-tree, syntax-based machine translation (SBMT). To make this work, we transform the AMR structure into a form suitable for the mechanics of SBMT and useful for modeling. We introduce an AMR-specific language model and add data and features drawn from semantic resources…
▽ More
We present a parser for Abstract Meaning Representation (AMR). We treat English-to-AMR conversion within the framework of string-to-tree, syntax-based machine translation (SBMT). To make this work, we transform the AMR structure into a form suitable for the mechanics of SBMT and useful for modeling. We introduce an AMR-specific language model and add data and features drawn from semantic resources. Our resulting AMR parser improves upon state-of-the-art results by 7 Smatch points.
△ Less
Submitted 28 April, 2015; v1 submitted 24 April, 2015;
originally announced April 2015.
-
Looking into the Theory of Pulsar Accretion: Cen X-3 and XTE J1946+274
Authors:
Diana M. Marcu,
Katja Pottschmidt,
Amy M. Gottlieb,
Michael T. Wolff,
Peter A. Becker,
Joern Wilms,
Carlo Ferrigno,
Kent S. Wood
Abstract:
This is an overview of pulsar accretion modeling. The physics of pulsar accretion, i.e., the process of plasma flow onto the neutron star surface, can be constrained from the spectral properties of the X-ray source. We discuss a new implementation of the physical continuum model developed by Becker and Wolff (2007, ApJ 654, 435). The model incorporates Comptonized blackbody, bremsstrahlung, and cy…
▽ More
This is an overview of pulsar accretion modeling. The physics of pulsar accretion, i.e., the process of plasma flow onto the neutron star surface, can be constrained from the spectral properties of the X-ray source. We discuss a new implementation of the physical continuum model developed by Becker and Wolff (2007, ApJ 654, 435). The model incorporates Comptonized blackbody, bremsstrahlung, and cyclotron emission. We discuss preliminary results of applying the new tool to the test cases of Suzaku data of Cen X-3 and XTE J1946+274. Cen X-3 is a persistent accreting pulsar with an O-star companion observed during a bright period. XTE J1946+274 is a transient accreting pulsar with a Be companion observed during a dim period. Both sources show spectra that are well described with an empirical Fermi Dirac cutoff power law model. We extend the spectral analysis by making the first steps towards a physical description of Cen X-3 and XTE J1946+274.
△ Less
Submitted 11 February, 2015;
originally announced February 2015.
-
Spectral and Timing Nature of the Symbiotic X-ray Binary 4U 1954+319: The Slowest Rotating Neutron Star in an X-ray Binary System
Authors:
Teruaki Enoto,
Makoto Sasano,
Shin'ya Yamada,
Toru Tamagawa,
Kazuo Makishima,
Katja Pottschmidt,
Diana Marcu,
Robin H. D. Corbet,
Felix Fuerst,
Jorn Wilms
Abstract:
The symbiotic X-ray binary 4U 1954+319 is a rare system hosting a peculiar neutron star (NS) and an M-type optical companion. Its ~5.4h NS spin period is the longest among all known accretion-powered pulsars and exhibited large (~7%) fluctuations over 8 years. A spin trend transition was detected with Swift/BAT around an X-ray brightening in 2012. The source was in quiescent and bright states befo…
▽ More
The symbiotic X-ray binary 4U 1954+319 is a rare system hosting a peculiar neutron star (NS) and an M-type optical companion. Its ~5.4h NS spin period is the longest among all known accretion-powered pulsars and exhibited large (~7%) fluctuations over 8 years. A spin trend transition was detected with Swift/BAT around an X-ray brightening in 2012. The source was in quiescent and bright states before and after this outburst based on 60 ks Suzaku observations in 2011 and 2012. The observed continuum is well described by a Comptonized model with the addition of a narrow 6.4 keV Fe Kalpha line during the outburst. Spectral similarities to slowly rotating pulsars in high-mass X-ray binaries, its high pulsed fraction (~60-80%), and the location in the Corbet diagram favor high B-field (>~1e+12 G) over a weak field as in low-mass X-ray binaries. The observed low X-ray luminosity (1e+33-1e+35 erg/s), probable wide orbit, and a slow stellar wind of this SyXB make quasi-spherical accretion in the subsonic settling regime a plausible model. Assuming a ~1e+13 G NS, this scheme can explain the ~5.4 h equilibrium rotation without employing the magnetar-like field (~1e+16 G) required in the disk accretion case. The time-scales of multiple irregular flares (~50 s) can also be attributed to the free-fall time from the Alfven shell for a ~1e+13 G field. A physical interpretation of SyXBs beyond the canonical binary classifications is discussed.
△ Less
Submitted 1 April, 2014;
originally announced April 2014.
-
Cygnus X-1: shedding light on the spectral variability of a black hole
Authors:
V. Grinberg,
N. Hell,
J. Wilms,
J. Rodriguez,
K. Pottschmidt,
M. A. Nowak,
M. Böck,
A. Bodaghee,
M. Cadolle Bel,
F. Fürst,
M. Hanke,
M. Kühnel,
P. Laurent,
S. B. Markoff,
A. Markowitz,
D. M. Marcu,
G. G. Pooley,
A. Popp,
R. E. Rothschild,
J. A. Tomsick
Abstract:
The knowledge of the spectral state of a black hole is essential for the interpretation of data from black holes in terms of their emission models. Based on pointed observations of Cyg X-1 with the Rossi X-ray timing Explorer (RXTE) that are used to classify simultaneous RXTE-ASM observations, we develop a scheme based on RXTE -ASM colors and count rates that can be used to classify all observatio…
▽ More
The knowledge of the spectral state of a black hole is essential for the interpretation of data from black holes in terms of their emission models. Based on pointed observations of Cyg X-1 with the Rossi X-ray timing Explorer (RXTE) that are used to classify simultaneous RXTE-ASM observations, we develop a scheme based on RXTE -ASM colors and count rates that can be used to classify all observations of this canonical black hole that were performed between 1996 and 2011. We show that a simple count rate criterion, as used previously, leads to a significantly higher fraction of misclassified observations. This scheme enables us to classify single INTEGRAL-IBIS science windows and to obtain summed spectra for the soft, intermediate and hard state with low contamination by other states.
△ Less
Submitted 11 March, 2013;
originally announced March 2013.
-
A double-peaked outburst of A 0535+26 observed with INTEGRAL, RXTE, and Suzaku
Authors:
I. Caballero,
K. Pottschmidt,
D. M. Marcu,
L. Barragan,
C. Ferrigno,
D. Klochkov,
J. A. Zurita Heras,
S. Suchy,
J. Wilms,
P. Kretschmar,
A. Santangelo,
I. Kreykenbohm,
F. Fürst,
R. Rothschild,
R. Staubert,
M. H. Finger,
A. Camero-Arranz,
K. Makishima,
T. Enoto,
W. Iwakiri,
Y. Terada
Abstract:
The Be/X-ray binary A 0535+26 showed a normal (type I) outburst in August 2009. It is the fourth in a series of normal outbursts associated with the periastron, but is unusual by presenting a double-peaked light curve. The two peaks reached a flux of ~450 mCrab in the 15-50 keV range. We present results of the timing and spectral analysis of INTEGRAL, RXTE, and Suzaku observations of the outburst.…
▽ More
The Be/X-ray binary A 0535+26 showed a normal (type I) outburst in August 2009. It is the fourth in a series of normal outbursts associated with the periastron, but is unusual by presenting a double-peaked light curve. The two peaks reached a flux of ~450 mCrab in the 15-50 keV range. We present results of the timing and spectral analysis of INTEGRAL, RXTE, and Suzaku observations of the outburst. The energy dependent pulse profiles and their evolution during the outburst are studied. No significant differences with respect to other normal outbursts are observed. The centroid energy of the fundamental cyclotron line shows no significant variation during the outburst. A spectral hardening with increasing luminosity is observed. We conclude that the source is accreting in the sub-critical regime. We discuss possible explanations for the double-peaked outburst.
△ Less
Submitted 21 January, 2013;
originally announced January 2013.
-
4U 1626-67 as seen by Suzaku before and after the 2008 torque reversal
Authors:
A. Camero-Arranz,
K. Pottschmidt,
M. H. Finger,
N. R. Ikhsanov,
C. A. Wilson-Hodge,
D. M. Marcu
Abstract:
Aims. The accretion-powered pulsar 4U 1626-67 experienced a new torque reversal at the beginning of 2008, after about 18 years of steadily spinning down. The main goal of the present work is to study this recent torque reversal that occurred in 2008 February.
Methods. We present a spectral analysis of this source using two pointed observations performed by Suzaku in 2006 March and in 2010 Septem…
▽ More
Aims. The accretion-powered pulsar 4U 1626-67 experienced a new torque reversal at the beginning of 2008, after about 18 years of steadily spinning down. The main goal of the present work is to study this recent torque reversal that occurred in 2008 February.
Methods. We present a spectral analysis of this source using two pointed observations performed by Suzaku in 2006 March and in 2010 September.
Results. We confirm with Suzaku the presence of a strong emission-line complex centered on 1 keV, with the strongest line being the hydrogen-like Ne Ly-alpha at 1.025(3) keV. We were able to resolve this complex with up to seven emission lines. A dramatic increase of the intensity of the Ne Ly-alpha line after the 2008 torque reversal occurred, with the equivalent width of this line reaching almost the same value measured by ASCA in 1993. We also report on the detection of a cyclotron line feature centered at ~37 keV. In spite of the fact that an increase of the X-ray luminosity (0.5-100 keV) of a factor of ~2.8 occurred between these two observations, no significant change in the energy of the cyclotron line feature was observed. However, the intensity of the ~1 keV line complex increased by an overall factor of ~8.
Conclusions. Our results favor a scenario in which the neutron star in 4U 1626-67 accretes material from a geometrically thin disk during both the spin-up and spin-down phases.
△ Less
Submitted 6 September, 2012; v1 submitted 18 November, 2011;
originally announced November 2011.
-
A Suzaku View of Cyclotron Line Sources and Candidates
Authors:
K. Pottschmidt,
S. Suchy,
E. Rivers,
R. E. Rothschild,
D. M. Marcu,
L. Barragán,
M. Kühnel,
F. Fürst,
F. Schwarm,
I. Kreykenbohm,
J. Wilms,
G. Schönherr,
I. Caballero,
A. Camero-Arranz,
A. Bodaghee,
V. Doroshenko,
D. Klochkov,
A. Santangelo,
R. Staubert,
P. Kretschmar,
C. Wilson-Hodge,
M. H. Finger,
Y. Terada
Abstract:
Seventeen accreting neutron star pulsars, mostly high mass X-ray binaries with half of them Be-type transients, are known to exhibit Cyclotron Resonance Scattering Features (CRSFs) in their X-ray spectra, with characteristic line energies from 10 to 60 keV. To date about two thirds of them, plus a few similar systems without known CRSFs, have been observed with Suzaku. We present an overview of re…
▽ More
Seventeen accreting neutron star pulsars, mostly high mass X-ray binaries with half of them Be-type transients, are known to exhibit Cyclotron Resonance Scattering Features (CRSFs) in their X-ray spectra, with characteristic line energies from 10 to 60 keV. To date about two thirds of them, plus a few similar systems without known CRSFs, have been observed with Suzaku. We present an overview of results from these observations, including the discovery of a CRSF in the transient 1A 1118-61 and pulse phase resolved spectroscopy of GX 301-2. These observations allow for the determination of cyclotron line parameters to an unprecedented degree of accuracy within a moderate amount of observing time. This is important since these parameters vary - e.g., with orbital phase, pulse phase, or luminosity - depending on the geometry of the magnetic field of the pulsar and the properties of the accretion column at the magnetic poles. We briefly introduce a spectral model for CRSFs that is currently being developed and that for the first time is based on these physical properties. In addition to cyclotron line measurements, selected highlights from the Suzaku analyses include dip and flare studies, e.g., of 4U 1907+09 and Vela X-1, which show clumpy wind effects (like partial absorption and/or a decrease in the mass accretion rate supplied by the wind) and may also display magnetospheric gating effects.
△ Less
Submitted 7 November, 2011;
originally announced November 2011.
-
The 5 hr pulse period and broadband spectrum of the Symbiotic X-ray Binary 3A 1954+319
Authors:
Diana M. Marcu,
Felix Fuerst,
Katja Pottschmidt,
Victoria Grinberg,
Sebastian Mueller,
Joern Wilms,
Konstantin A. Postnov,
Robin H. D. Corbet,
Craig B. Markwardt,
Marion Cadolle Bel
Abstract:
We present an analysis of the highly variable accreting X-ray pulsar 3A 1954+319 using 2005-2009 monitoring data obtained with INTEGRAL and Swift. This considerably extends the pulse period history and covers flaring episodes in 2005 and 2008. In 2006 the source was identified as one of only a few known symbiotic X-ray binaries (SyXBs), i.e., systems composed of a neutron star accreting from the i…
▽ More
We present an analysis of the highly variable accreting X-ray pulsar 3A 1954+319 using 2005-2009 monitoring data obtained with INTEGRAL and Swift. This considerably extends the pulse period history and covers flaring episodes in 2005 and 2008. In 2006 the source was identified as one of only a few known symbiotic X-ray binaries (SyXBs), i.e., systems composed of a neutron star accreting from the inhomogeneous medium around an M-giant star. The extremely long pulse period of 5.3 hr is directly visible in the 2008 INTEGRAL-ISGRI outburst light curve. The pulse profile is double peaked and generally not significantly energy dependent although there is an indication of possible softening during the main pulse. During the outburst a strong spin-up of -1.8 10^(-4) hr hr^(-1) occurred. Between 2005 and 2008 a long-term spin-down trend of 2.1 10^-5 hr hr^(-1) was observed for the first time for this source. The 3-80 keV pulse peak spectrum of 3A 1954+319 during the 2008 flare could be well described by a thermal Comptonization model. We interpret the results within the framework of a recently developed quasi-spherical accretion model for SyXBs.
△ Less
Submitted 3 November, 2011;
originally announced November 2011.
-
Domain Adaptation for Statistical Classifiers
Authors:
H. Daume III,
D. Marcu
Abstract:
The most basic assumption used in statistical learning theory is that training data and test data are drawn from the same underlying distribution. Unfortunately, in many applications, the "in-domain" test data is drawn from a distribution that is related, but not identical, to the "out-of-domain" distribution of the training data. We consider the common case in which labeled out-of-domain data is…
▽ More
The most basic assumption used in statistical learning theory is that training data and test data are drawn from the same underlying distribution. Unfortunately, in many applications, the "in-domain" test data is drawn from a distribution that is related, but not identical, to the "out-of-domain" distribution of the training data. We consider the common case in which labeled out-of-domain data is plentiful, but labeled in-domain data is scarce. We introduce a statistical formulation of this problem in terms of a simple mixture model and present an instantiation of this framework to maximum entropy classifiers and their linear chain counterparts. We present efficient inference algorithms for this special case based on the technique of conditional expectation maximization. Our experimental results show that our approach leads to improved performance on three real world tasks on four different data sets from the natural language processing domain.
△ Less
Submitted 28 September, 2011;
originally announced September 2011.
-
The Be/X-ray binary A0535+26 during its recent 2009/2010 outbursts
Authors:
I. Caballero,
K. Pottschmidt,
A. Santangelo,
L. Barragan,
D. Klochkov,
C. Ferrigno,
J. Rodriguez,
P. Kretschmar,
S. Suchy,
D. M. Marcu,
D. Mueller,
J. Wilms,
I. Kreykenbohm,
R. E. Rothschild,
R. Staubert,
M. H. Finger,
A. Camero-Arranz,
K. Makishima,
T. Mihara,
M. Nakajima,
T. Enoto,
W. Iwakiri,
Y. Terada
Abstract:
The Be/X-ray binary A0535+26 showed a giant outburst in December 2009 that reached ~5.14 Crab in the 15-50 keV range. Unfortunately, due to Sun constraints it could not be observed by most X-ray satellites. The outburst was preceded by four weaker outbursts associated with the periastron passage of the neutron star. The fourth of them, in August 2009, presented a peculiar double-peaked light curve…
▽ More
The Be/X-ray binary A0535+26 showed a giant outburst in December 2009 that reached ~5.14 Crab in the 15-50 keV range. Unfortunately, due to Sun constraints it could not be observed by most X-ray satellites. The outburst was preceded by four weaker outbursts associated with the periastron passage of the neutron star. The fourth of them, in August 2009, presented a peculiar double-peaked light curve, with a first peak lasting about 9 days that reached a (15-50 keV) flux of 440 mCrab. The flux then decreased to less than 220 mCrab, and increased again reaching 440 Crab around the periastron. The outburst was monitored with INTEGRAL, RXTE, and Suzaku TOO observations. One orbital period (~111 days) after the 2009 giant outburst, a new and unexpectedly bright outburst took place (~1.4Crab in the 15-50 keV range). It was monitored with TOO obs ervations with INTEGRAL, RXTE, Suzaku, and Swift. First results of the spectral and timing analysis of these observations are presented, with a specific focus on the cyclotron lines present in the system and its variation with the mass accretion rate.
△ Less
Submitted 18 July, 2011;
originally announced July 2011.
-
Spinning-up: the case of the symbiotic X-ray binary 3A 1954+319
Authors:
F. Fürst,
D. M. Marcu,
K. Pottschmidt,
V. Grinberg,
J. Wilms,
M. Cadolle Bel
Abstract:
We present a timing and spectral analysis of the variable X-ray source 3A 1954+319. Our analysis is mainly based on an outburst serendipitously observed during INTEGRAL Key Program observations of the Cygnus region in 2008 fall and on the Swift/BAT longterm light curve. Previous observations, though sparse, have identified the source to be one of only nine known symbiotic X-ray binaries, i.e., sys…
▽ More
We present a timing and spectral analysis of the variable X-ray source 3A 1954+319. Our analysis is mainly based on an outburst serendipitously observed during INTEGRAL Key Program observations of the Cygnus region in 2008 fall and on the Swift/BAT longterm light curve. Previous observations, though sparse, have identified the source to be one of only nine known symbiotic X-ray binaries, i.e., systems composed of an accreting neutron star orbiting in a highly inhomogeneous medium around an M-giant companion. The spectrum of 3A 1954+319 above 20 keV can be best described by a broken power law model. The extremely long pulse period of ~5.3 hours is clearly visible in the INTEGRAL/ISGRI light curve and confirmed through an epoch folding period search. Furthermore, the light curve allows us to determine a very strong spin up of -2x10^-4 h/h during the outburst. This spin up is confirmed by the pulse period evolution calculated from Swift/BAT data. The Swift/BAT data also show a long spin-down trend prior to the 2008 outburst, which is confirmed in archival INTEGRAL/ISGRI data. We discuss possible accretion models and geometries allowing for the transfer of such large amounts of angular momentum and investigate the harder spectrum of this outburst compared to previously published results.
△ Less
Submitted 14 June, 2011;
originally announced June 2011.
-
Learning as Search Optimization: Approximate Large Margin Methods for Structured Prediction
Authors:
Hal Daumé III,
Daniel Marcu
Abstract:
Map**s to structured output spaces (strings, trees, partitions, etc.) are typically learned using extensions of classification algorithms to simple graphical structures (eg., linear chains) in which search and parameter estimation can be performed exactly. Unfortunately, in many complex problems, it is rare that exact search or parameter estimation is tractable. Instead of learning exact model…
▽ More
Map**s to structured output spaces (strings, trees, partitions, etc.) are typically learned using extensions of classification algorithms to simple graphical structures (eg., linear chains) in which search and parameter estimation can be performed exactly. Unfortunately, in many complex problems, it is rare that exact search or parameter estimation is tractable. Instead of learning exact models and searching via heuristic means, we embrace this difficulty and treat the structured output problem in terms of approximate search. We present a framework for learning as search optimization, and two parameter updates with convergence theorems and bounds. Empirical evidence shows that our integrated approach to learning and decoding can outperform exact models at smaller computational cost.
△ Less
Submitted 4 July, 2009;
originally announced July 2009.
-
A Bayesian Model for Supervised Clustering with the Dirichlet Process Prior
Authors:
Hal Daumé III,
Daniel Marcu
Abstract:
We develop a Bayesian framework for tackling the supervised clustering problem, the generic problem encountered in tasks such as reference matching, coreference resolution, identity uncertainty and record linkage. Our clustering model is based on the Dirichlet process prior, which enables us to define distributions over the countably infinite sets that naturally arise in this problem. We add sup…
▽ More
We develop a Bayesian framework for tackling the supervised clustering problem, the generic problem encountered in tasks such as reference matching, coreference resolution, identity uncertainty and record linkage. Our clustering model is based on the Dirichlet process prior, which enables us to define distributions over the countably infinite sets that naturally arise in this problem. We add supervision to our model by positing the existence of a set of unobserved random variables (we call these "reference types") that are generic across all clusters. Inference in our framework, which requires integrating over infinitely many parameters, is solved using Markov chain Monte Carlo techniques. We present algorithms for both conjugate and non-conjugate priors. We present a simple--but general--parameterization of our model based on a Gaussian assumption. We evaluate this model on one artificial task and three real-world tasks, comparing it against both unsupervised and state-of-the-art supervised algorithms. Our results show that our model is able to outperform other models across a variety of tasks and performance metrics.
△ Less
Submitted 4 July, 2009;
originally announced July 2009.
-
A Large-Scale Exploration of Effective Global Features for a Joint Entity Detection and Tracking Model
Authors:
Hal Daumé III,
Daniel Marcu
Abstract:
Entity detection and tracking (EDT) is the task of identifying textual mentions of real-world entities in documents, extending the named entity detection and coreference resolution task by considering mentions other than names (pronouns, definite descriptions, etc.). Like NE tagging and coreference resolution, most solutions to the EDT task separate out the mention detection aspect from the core…
▽ More
Entity detection and tracking (EDT) is the task of identifying textual mentions of real-world entities in documents, extending the named entity detection and coreference resolution task by considering mentions other than names (pronouns, definite descriptions, etc.). Like NE tagging and coreference resolution, most solutions to the EDT task separate out the mention detection aspect from the coreference aspect. By doing so, these solutions are limited to using only local features for learning. In contrast, by modeling both aspects of the EDT task simultaneously, we are able to learn using highly complex, non-local features. We develop a new joint EDT model and explore the utility of many features, demonstrating their effectiveness on this task.
△ Less
Submitted 4 July, 2009;
originally announced July 2009.
-
A Noisy-Channel Model for Document Compression
Authors:
Hal Daumé III,
Daniel Marcu
Abstract:
We present a document compression system that uses a hierarchical noisy-channel model of text production. Our compression system first automatically derives the syntactic structure of each sentence and the overall discourse structure of the text given as input. The system then uses a statistical hierarchical model of text production in order to drop non-important syntactic and discourse constitu…
▽ More
We present a document compression system that uses a hierarchical noisy-channel model of text production. Our compression system first automatically derives the syntactic structure of each sentence and the overall discourse structure of the text given as input. The system then uses a statistical hierarchical model of text production in order to drop non-important syntactic and discourse constituents so as to generate coherent, grammatical document compressions of arbitrary length. The system outperforms both a baseline and a sentence-based compression system that operates by simplifying sequentially all sentences in a text. Our results support the claim that discourse knowledge plays an important role in document summarization.
△ Less
Submitted 4 July, 2009;
originally announced July 2009.
-
Induction of Word and Phrase Alignments for Automatic Document Summarization
Authors:
Hal Daumé III,
Daniel Marcu
Abstract:
Current research in automatic single document summarization is dominated by two effective, yet naive approaches: summarization by sentence extraction, and headline generation via bag-of-words models. While successful in some tasks, neither of these models is able to adequately capture the large set of linguistic devices utilized by humans when they produce summaries. One possible explanation for…
▽ More
Current research in automatic single document summarization is dominated by two effective, yet naive approaches: summarization by sentence extraction, and headline generation via bag-of-words models. While successful in some tasks, neither of these models is able to adequately capture the large set of linguistic devices utilized by humans when they produce summaries. One possible explanation for the widespread use of these models is that good techniques have been developed to extract appropriate training data for them from existing document/abstract and document/headline corpora. We believe that future progress in automatic summarization will be driven both by the development of more sophisticated, linguistically informed models, as well as a more effective leveraging of document/abstract corpora. In order to open the doors to simultaneously achieving both of these goals, we have developed techniques for automatically producing word-to-word and phrase-to-phrase alignments between documents and their human-written abstracts. These alignments make explicit the correspondences that exist in such document/abstract pairs, and create a potentially rich data source from which complex summarization algorithms may learn. This paper describes experiments we have carried out to analyze the ability of humans to perform such alignments, and based on these analyses, we describe experiments for creating them automatically. Our model for the alignment task is based on an extension of the standard hidden Markov model, and learns to create alignments in a completely unsupervised fashion. We describe our model in detail and present experimental results that show that our model is able to learn to reliably identify word- and phrase-level alignments in a corpus of <document,abstract> pairs.
△ Less
Submitted 4 July, 2009;
originally announced July 2009.
-
Search-based Structured Prediction
Authors:
Hal Daumé III,
John Langford,
Daniel Marcu
Abstract:
We present Searn, an algorithm for integrating search and learning to solve complex structured prediction problems such as those that occur in natural language, speech, computational biology, and vision. Searn is a meta-algorithm that transforms these complex problems into simple classification problems to which any binary classifier may be applied. Unlike current algorithms for structured learn…
▽ More
We present Searn, an algorithm for integrating search and learning to solve complex structured prediction problems such as those that occur in natural language, speech, computational biology, and vision. Searn is a meta-algorithm that transforms these complex problems into simple classification problems to which any binary classifier may be applied. Unlike current algorithms for structured learning that require decomposition of both the loss function and the feature functions over the predicted structure, Searn is able to learn prediction functions for any loss function and any class of features. Moreover, Searn comes with a strong, natural theoretical guarantee: good performance on the derived classification problems implies good performance on the structured prediction problem.
△ Less
Submitted 4 July, 2009;
originally announced July 2009.
-
A Spitzer Spectroscopic Survey of Low Ionization Nuclear Emission-line Regions: Characterization of the Central Source
Authors:
R. P. Dudik,
S. Satyapal,
D. Marcu
Abstract:
We have conducted a comprehensive mid-IR spectroscopic investigation of 67 Low Ionization Nuclear Emission Line Regions (LINERs) using archival observations from the high resolution modules of the Infrared Spectrograph on board the Spitzer Space Telescope. Using the [NeV] 14 and 24um lines as active galactic nuclei (AGN) diagnostics, we detect active black holes in 39% of the galaxies in our sam…
▽ More
We have conducted a comprehensive mid-IR spectroscopic investigation of 67 Low Ionization Nuclear Emission Line Regions (LINERs) using archival observations from the high resolution modules of the Infrared Spectrograph on board the Spitzer Space Telescope. Using the [NeV] 14 and 24um lines as active galactic nuclei (AGN) diagnostics, we detect active black holes in 39% of the galaxies in our sample, many of which show no signs of activity in either the optical or X-ray bands. In particular, a detailed comparison of multi-wavelength diagnostics shows that optical studies fail to detect AGN in galaxies with large far-IR luminosities. These observations emphasize that the nuclear power source in a large percentage of LINERs is obscured in the optical. Indeed, the majority of LINERs show mid-IR [NeV]14/[NeV]24um flux ratios well below the theoretical low-density limit, suggesting that there is substantial extinction toward even the [NeV]-emitting region . Combining optical, X-ray, and mid-IR diagnostics, we find an AGN detection rate in LINERs of 74%, higher than previously reported statistics of the fraction of LINERs hosting AGN. The [NeV]24um /[OIV]26um mid-IR line flux ratio in "AGN-LINERs" is similar to that of standard AGN, suggesting that the spectral energy distribution (SED) of the intrinsic optical/UV continuum is similar in the two. This result is in contrast to previous suggestions of a UV deficit in the intrinsic broadband continuum emission in AGN-LINERs. Consistent with our finding of extinction to the [NeV]-emitting region, we propose that extinction may also be responsible for the observed optical/UV deficit seen in at least some AGN-LINERs.
△ Less
Submitted 8 November, 2008;
originally announced November 2008.
-
A Formalism and an Algorithm for Computing Pragmatic Inferences and Detecting Infelicities
Authors:
Daniel Marcu
Abstract:
Since Austin introduced the term ``infelicity'', the linguistic literature has been flooded with its use, but no formal or computational explanation has been given for it. This thesis provides one for those infelicities that occur when a pragmatic inference is cancelled.
Our contribution assumes the existence of a finer grained taxonomy with respect to pragmatic inferences. It is shown that if…
▽ More
Since Austin introduced the term ``infelicity'', the linguistic literature has been flooded with its use, but no formal or computational explanation has been given for it. This thesis provides one for those infelicities that occur when a pragmatic inference is cancelled.
Our contribution assumes the existence of a finer grained taxonomy with respect to pragmatic inferences. It is shown that if one wants to account for the natural language expressiveness, one should distinguish between pragmatic inferences that are felicitous to defeat and pragmatic inferences that are infelicitously defeasible. Thus, it is shown that one should consider at least three types of information: indefeasible, felicitously defeasible, and infelicitously defeasible. The cancellation of the last of these determines the pragmatic infelicities.
A new formalism has been devised to accommodate the three levels of information, called ``stratified logic''. Within it, we are able to express formally notions such as ``utterance U presupposes P'' or ``utterance U is infelicitous''. Special attention is paid to the implications that our work has in solving some well-known existential philosophical puzzles. The formalism yields an algorithm for computing interpretations for utterances, for determining their associated presuppositions, and for signalling infelicitous utterances that has been implemented in Common Lisp. The algorithm applies equally to simple and complex utterances and sequences of utterances.
△ Less
Submitted 26 April, 1995; v1 submitted 25 April, 1995;
originally announced April 1995.
-
An Implemented Formalism for Computing Linguistic Presuppositions and Existential Commitments
Authors:
Daniel Marcu,
Graeme Hirst
Abstract:
We rely on the strength of linguistic and philosophical perspectives in constructing a framework that offers a unified explanation for presuppositions and existential commitment. We use a rich ontology and a set of methodological principles that embed the essence of Meinong's philosophy and Grice's conversational principles into a stratified logic, under an unrestricted interpretation of the qua…
▽ More
We rely on the strength of linguistic and philosophical perspectives in constructing a framework that offers a unified explanation for presuppositions and existential commitment. We use a rich ontology and a set of methodological principles that embed the essence of Meinong's philosophy and Grice's conversational principles into a stratified logic, under an unrestricted interpretation of the quantifiers. The result is a logical formalism that yields a tractable computational method that uniformly calculates all the presuppositions of a given utterance, including the existential ones.
△ Less
Submitted 25 April, 1995;
originally announced April 1995.
-
A Uniform Treatment of Pragmatic Inferences in Simple and Complex Utterances and Sequences of Utterances
Authors:
Daniel Marcu,
Graeme Hirst
Abstract:
Drawing appropriate defeasible inferences has been proven to be one of the most pervasive puzzles of natural language processing and a recurrent problem in pragmatics. This paper provides a theoretical framework, called ``stratified logic'', that can accommodate defeasible pragmatic inferences. The framework yields an algorithm that computes the conversational, conventional, scalar, clausal, and…
▽ More
Drawing appropriate defeasible inferences has been proven to be one of the most pervasive puzzles of natural language processing and a recurrent problem in pragmatics. This paper provides a theoretical framework, called ``stratified logic'', that can accommodate defeasible pragmatic inferences. The framework yields an algorithm that computes the conversational, conventional, scalar, clausal, and normal state implicatures; and the presuppositions that are associated with utterances. The algorithm applies equally to simple and complex utterances and sequences of utterances.
△ Less
Submitted 25 April, 1995;
originally announced April 1995.