Search | arXiv e-print repository

From RAG to RICHES: Retrieval Interlaced with Sequence Generation

Authors: Palak Jain, Livio Baldini Soares, Tom Kwiatkowski

Abstract: We present RICHES, a novel approach that interleaves retrieval with sequence generation tasks. RICHES offers an alternative to conventional RAG systems by eliminating the need for separate retriever and generator. It retrieves documents by directly decoding their contents, constrained on the corpus. Unifying retrieval with generation allows us to adapt to diverse new tasks via prompting alone. RIC… ▽ More We present RICHES, a novel approach that interleaves retrieval with sequence generation tasks. RICHES offers an alternative to conventional RAG systems by eliminating the need for separate retriever and generator. It retrieves documents by directly decoding their contents, constrained on the corpus. Unifying retrieval with generation allows us to adapt to diverse new tasks via prompting alone. RICHES can work with any Instruction-tuned model, without additional training. It provides attributed evidence, supports multi-hop retrievals and interleaves thoughts to plan on what to retrieve next, all within a single decoding pass of the LLM. We demonstrate the strong performance of RICHES across ODQA tasks including attributed and multi-hop QA. △ Less

Submitted 29 June, 2024; originally announced July 2024.

Comments: 18 pages, 3 figures, Preprint

arXiv:2403.05530 [pdf, other]

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content. △ Less

Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

arXiv:2312.11805 [pdf, other]

Gemini: A Family of Highly Capable Multimodal Models

Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI. △ Less

Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

arXiv:2310.16568 [pdf, other]

1-PAGER: One Pass Answer Generation and Evidence Retrieval

Authors: Palak Jain, Livio Baldini Soares, Tom Kwiatkowski

Abstract: We present 1-Pager the first system that answers a question and retrieves evidence using a single Transformer-based model and decoding process. 1-Pager incrementally partitions the retrieval corpus using constrained decoding to select a document and answer string, and we show that this is competitive with comparable retrieve-and-read alternatives according to both retrieval and answer accuracy met… ▽ More We present 1-Pager the first system that answers a question and retrieves evidence using a single Transformer-based model and decoding process. 1-Pager incrementally partitions the retrieval corpus using constrained decoding to select a document and answer string, and we show that this is competitive with comparable retrieve-and-read alternatives according to both retrieval and answer accuracy metrics. 1-Pager also outperforms the equivalent closed-book question answering model, by grounding predictions in an evidence corpus. While 1-Pager is not yet on-par with more expensive systems that read many more documents before generating an answer, we argue that it provides an important step toward attributed generation by folding retrieval into the sequence-to-sequence paradigm that is currently dominant in NLP. We also show that the search paths used to partition the corpus are easy to read and understand, paving a way forward for interpretable neural retrieval. △ Less

Submitted 25 October, 2023; originally announced October 2023.

Comments: Accepted at EMNLP 2023 (Findings)

arXiv:2305.14499 [pdf, other]

NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders

Authors: Livio Baldini Soares, Daniel Gillick, Jeremy R. Cole, Tom Kwiatkowski

Abstract: Neural document rerankers are extremely effective in terms of accuracy. However, the best models require dedicated hardware for serving, which is costly and often not feasible. To avoid this serving-time requirement, we present a method of capturing up to 86% of the gains of a Transformer cross-attention model with a lexicalized scoring function that only requires 10-6% of the Transformer's FLOPs… ▽ More Neural document rerankers are extremely effective in terms of accuracy. However, the best models require dedicated hardware for serving, which is costly and often not feasible. To avoid this serving-time requirement, we present a method of capturing up to 86% of the gains of a Transformer cross-attention model with a lexicalized scoring function that only requires 10-6% of the Transformer's FLOPs per document and can be served using commodity CPUs. When combined with a BM25 retriever, this approach matches the quality of a state-of-the art dual encoder retriever, that still requires an accelerator for query encoding. We introduce NAIL (Non-Autoregressive Indexing with Language models) as a model architecture that is compatible with recent encoder-decoder and decoder-only large language models, such as T5, GPT-3 and PaLM. This model architecture can leverage existing pre-trained checkpoints and can be fine-tuned for efficiently constructing document representations that do not require neural processing of queries. △ Less

Submitted 23 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

Comments: To appear at EMNLP 2023

arXiv:2305.14332 [pdf, other]

Evaluating and Modeling Attribution for Cross-Lingual Question Answering

Authors: Benjamin Muller, John Wieting, Jonathan H. Clark, Tom Kwiatkowski, Sebastian Ruder, Livio Baldini Soares, Roee Aharoni, Jonathan Herzig, Xinyi Wang

Abstract: Trustworthy answer content is abundant in many high-resource languages and is instantly accessible through question answering systems, yet this content can be hard to access for those that do not speak these languages. The leap forward in cross-lingual modeling quality offered by generative language models offers much promise, yet their raw generations often fall short in factuality. To improve tr… ▽ More Trustworthy answer content is abundant in many high-resource languages and is instantly accessible through question answering systems, yet this content can be hard to access for those that do not speak these languages. The leap forward in cross-lingual modeling quality offered by generative language models offers much promise, yet their raw generations often fall short in factuality. To improve trustworthiness in these systems, a promising direction is to attribute the answer to a retrieved source, possibly in a content-rich language different from the query. Our work is the first to study attribution for cross-lingual question answering. First, we collect data in 5 languages to assess the attribution level of a state-of-the-art cross-lingual QA system. To our surprise, we find that a substantial portion of the answers is not attributable to any retrieved passages (up to 50% of answers exactly matching a gold reference) despite the system being able to attend directly to the retrieved text. Second, to address this poor attribution level, we experiment with a wide range of attribution detection techniques. We find that Natural Language Inference models and PaLM 2 fine-tuned on a very small amount of attribution data can accurately detect attribution. Based on these models, we improve the attribution level of a cross-lingual question-answering system. Overall, we show that current academic generative cross-lingual QA systems have substantial shortcomings in attribution and we build tooling to mitigate these issues. △ Less

Submitted 15 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

Comments: Published as a long paper at EMNLP 2023

arXiv:2212.08037 [pdf, other]

Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models

Authors: Bernd Bohnet, Vinh Q. Tran, Pat Verga, Roee Aharoni, Daniel Andor, Livio Baldini Soares, Massimiliano Ciaramita, Jacob Eisenstein, Kuzman Ganchev, Jonathan Herzig, Kai Hui, Tom Kwiatkowski, Ji Ma, Jianmo Ni, Lierni Sestorain Saralegui, Tal Schuster, William W. Cohen, Michael Collins, Dipanjan Das, Donald Metzler, Slav Petrov, Kellie Webster

Abstract: Large language models (LLMs) have shown impressive results while requiring little or no direct supervision. Further, there is mounting evidence that LLMs may have potential in information-seeking scenarios. We believe the ability of an LLM to attribute the text that it generates is likely to be crucial in this setting. We formulate and study Attributed QA as a key first step in the development of… ▽ More Large language models (LLMs) have shown impressive results while requiring little or no direct supervision. Further, there is mounting evidence that LLMs may have potential in information-seeking scenarios. We believe the ability of an LLM to attribute the text that it generates is likely to be crucial in this setting. We formulate and study Attributed QA as a key first step in the development of attributed LLMs. We propose a reproducible evaluation framework for the task and benchmark a broad set of architectures. We take human annotations as a gold standard and show that a correlated automatic metric is suitable for development. Our experimental work gives concrete answers to two key questions (How to measure attribution?, and How well do current state-of-the-art methods perform on attribution?), and give some hints as to how to address a third (How to build LLMs with attribution?). △ Less

Submitted 10 February, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

arXiv:2205.11337 [pdf]

doi 10.1093/mnras/stac1008

Asteroid phase curves using sparse Gaia DR2 data and differential dense light curves

Authors: E. Wilawer, D. Oszkiewicz, A. Kryszczyńska, A. Marciniak, V. Shevchenko, I. Belskaya, T. Kwiatkowski, P. Kankiewicz, J. Horbowicz, V. Kudak, P. Kulczak, V. Perig, K. Sobkowiak

Abstract: The amount of sparse asteroid photometry being gathered by both space- and ground-based surveys is growing exponentially. This large volume of data poses a computational challenge owing to both the large amount of information to be processed and the new methods needed to combine data from different sources (e.g. obtained by different techniques, in different bands, and having different random and… ▽ More The amount of sparse asteroid photometry being gathered by both space- and ground-based surveys is growing exponentially. This large volume of data poses a computational challenge owing to both the large amount of information to be processed and the new methods needed to combine data from different sources (e.g. obtained by different techniques, in different bands, and having different random and systematic errors). The main goal of this work is to develop an algorithm capable of merging sparse and dense data sets, both relative and differential, in preparation for asteroid observations originating from, for example, Gaia, TESS, ATLAS, LSST, K2, VISTA, and many other sources. We present a novel method to obtain asteroid phase curves by combining sparse photometry and differential ground-based photometry. In the traditional approach, the latter cannot be used for phase curves. Merging those two data types allows for the extraction of phase-curve information for a growing number of objects. Our method is validated for 26 sample asteroids observed by the Gaia mission. △ Less

Submitted 23 May, 2022; originally announced May 2022.

Comments: 10 pages, 7 figures (supplementary material: 20 pages, 58 figures)

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 513, Issue 3, July 2022, Pages 3242-3251

arXiv:2109.11689 [pdf, other]

doi 10.1051/0004-6361/202142013

Photometry and model of near-Earth asteroid 2021 DW1 from one apparition

Authors: T. Kwiatkowski, P. Koleńczuk, A. Kryszczyńska, D. Oszkiewicz, K. Kamiński, M. K. Kamińska, V. Troianskyi, B. Skiff, N. Moskowitz, V. Kashuba, M. -J. Kim, T. Kim, S. Mottola, T. Santana-Ros, T. Kluwak, L. Buzzi, P. Bacci, P. Birtwhistle, R. Miles, J. Chatelain

Abstract: On 4 March 2021 at 9 UTC a 30-m in diameter near-Earth asteroid 2021 DW1 passed the Earth at a distance of 570000 km, reaching the maximum brightness of V=14.6 mag. We observed it photometrically from 2 March, when it was visible at V=16.5 mag, until 7 March (V=18.2 mag). During that time 2021 DW1 swept a 170 degrees long arc in the northern sky, spanning solar phase angles in the range from 36 to… ▽ More On 4 March 2021 at 9 UTC a 30-m in diameter near-Earth asteroid 2021 DW1 passed the Earth at a distance of 570000 km, reaching the maximum brightness of V=14.6 mag. We observed it photometrically from 2 March, when it was visible at V=16.5 mag, until 7 March (V=18.2 mag). During that time 2021 DW1 swept a 170 degrees long arc in the northern sky, spanning solar phase angles in the range from 36 to 86 degrees. This made it an excellent target for physical characterisation, including spin axis and shape derivation. Convex inversion of the asteroid lightcurves gives a sidereal period of rotation P=0.013760 +/- 0.000001 h, and two solutions for the spin axis ecliptic coordinates: (A) lambda_1=57 +/- 10, beta_1=29 +/- 10, and (B) lambda_2=67 +/- 10, beta_2=-40 +/- 10. The magnitude-phase curve can be fitted with a standard H, G function with H=24.8 +/- 0.5 mag and an assumed G=0.24. The asteroid colour indices are g-i=0.79 +/- 0.01 mag, and i-z=0.01 +/- 0.02 mag which indicates an S taxonomic class, with an average geometric albedo p_V=0.23 +/- 0.02. The asteroid effective diameter, derived from H and p_V, is D=30 +/- 10 m. It was found that the inclination of the spin axis of 2021 DW1 is not perpendicular to the orbital plane (obliquity epsilon=54 +/- 10 or epsilon=123 +/- 10). More spin axes of VSAs should be determined to check, if 2021 DW1 is an exception or a typical case. △ Less

Submitted 23 September, 2021; originally announced September 2021.

Comments: 9 pages, 9 figures, submitted to A&A (version after revision)

Journal ref: A&A 656, A126 (2021)

arXiv:2106.07352 [pdf, other]

MOLEMAN: Mention-Only Linking of Entities with a Mention Annotation Network

Authors: Nicholas FitzGerald, Jan A. Botha, Daniel Gillick, Daniel M. Bikel, Tom Kwiatkowski, Andrew McCallum

Abstract: We present an instance-based nearest neighbor approach to entity linking. In contrast to most prior entity retrieval systems which represent each entity with a single vector, we build a contextualized mention-encoder that learns to place similar mentions of the same entity closer in vector space than mentions of different entities. This approach allows all mentions of an entity to serve as "class… ▽ More We present an instance-based nearest neighbor approach to entity linking. In contrast to most prior entity retrieval systems which represent each entity with a single vector, we build a contextualized mention-encoder that learns to place similar mentions of the same entity closer in vector space than mentions of different entities. This approach allows all mentions of an entity to serve as "class prototypes" as inference involves retrieving from the full set of labeled entity mentions in the training set and applying the nearest mention neighbor's entity label. Our model is trained on a large multilingual corpus of mention pairs derived from Wikipedia hyperlinks, and performs nearest neighbor inference on an index of 700 million mentions. It is simpler to train, gives more interpretable predictions, and outperforms all other systems on two multilingual entity linking benchmarks. △ Less

Submitted 22 July, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

Comments: Accepted to ACL 2021, edit to add missing Turkish results in Tables 2 and 7

arXiv:2105.09763 [pdf]

doi 10.1016/j.pss.2021.105248

Photometry of selected outer main belt asteroids

Authors: V. Shevchenko, O. Mikhalchenko, I. Belskaya, I. Slyusarev, V. Chiorny, Yu. Krugly, T. Hromakina, A. Dovgopol, N. Kiselev, A. Rublevsky, K. Antonyuk, A. Novichonok, A. Kusakin, I. Reva, R. Inasaridze, V. Ayvazian, G. Kapanadze, I. Molotov, D. Oszkiewicz, T. Kwiatkowski

Abstract: We present new photometric observations for twelve asteroids ((122) Gerda, (152) Atala, (260) Huberta, (665) Sabine, (692) Hippodamia, (723) Hammonia, (745) Mauritia, (768) Struveana, (863) Benkoela, (1113) Katja, (1175) Margo, (2057) Rosemary) from the outer part of the main belt aimed to obtain the magnitude-phase curves and to verify geometric albedo and taxonomic class based on their magnitude… ▽ More We present new photometric observations for twelve asteroids ((122) Gerda, (152) Atala, (260) Huberta, (665) Sabine, (692) Hippodamia, (723) Hammonia, (745) Mauritia, (768) Struveana, (863) Benkoela, (1113) Katja, (1175) Margo, (2057) Rosemary) from the outer part of the main belt aimed to obtain the magnitude-phase curves and to verify geometric albedo and taxonomic class based on their magnitude-phase behaviors. The measured magnitude-phase relations confirm previously determined composition types of (260) Huberta (C-type), (692) Hippodamia (S-type) and (1175) Margo (S-type). Asteroids (665) Sabine and (768) Struveana previously classified as X-type show phase-curve behavior typical for moderate-albedo asteroids and may belong to the M-type. The phase-curve of (723) Hammonia is typical for the S-type which contradicts the previously determined C-type. We confirmed the moderate-albedo of asteroids (122) Gerda and (152) Atala, but their phase-curves are different from typical for the S-type and may indicate more rare compositional types. Based on magnitude-phase behaviors and V-R colors, (2057) Rosemary most probably belongs to M-type, while asteroids (745) Mauritia and (1113) Katja belong to S-complex. The phase curve of the A-type asteroid (863) Benkoela does not cover the opposition effect range and further observations are needed to understand typical features of the phase-curves of A-type asteroids in comparison with other types. We have also determined lightcurve amplitudes of the observed asteroids and obtained new or improved values of the rotation periods for most of them. △ Less

Submitted 20 May, 2021; originally announced May 2021.

Comments: 16 pages

arXiv:2102.05169 [pdf, other]

Decontextualization: Making Sentences Stand-Alone

Authors: Eunsol Choi, Jennimaria Palomaki, Matthew Lamm, Tom Kwiatkowski, Dipanjan Das, Michael Collins

Abstract: Models for question answering, dialogue agents, and summarization often interpret the meaning of a sentence in a rich context and use that meaning in a new context. Taking excerpts of text can be problematic, as key pieces may not be explicit in a local window. We isolate and define the problem of sentence decontextualization: taking a sentence together with its context and rewriting it to be inte… ▽ More Models for question answering, dialogue agents, and summarization often interpret the meaning of a sentence in a rich context and use that meaning in a new context. Taking excerpts of text can be problematic, as key pieces may not be explicit in a local window. We isolate and define the problem of sentence decontextualization: taking a sentence together with its context and rewriting it to be interpretable out of context, while preserving its meaning. We describe an annotation procedure, collect data on the Wikipedia corpus, and use the data to train models to automatically decontextualize sentences. We present preliminary studies that show the value of sentence decontextualization in a user facing task, and as preprocessing for systems that perform document understanding. We argue that decontextualization is an important subtask in many downstream applications, and that the definitions and resources provided can benefit tasks that operate on sentences that occur in a richer context. △ Less

Submitted 9 February, 2021; originally announced February 2021.

Comments: To appear in Transactions of the Association for Computational Linguistics (TACL)

arXiv:2101.00133 [pdf, other]

NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

Authors: Sewon Min, Jordan Boyd-Graber, Chris Alberti, Danqi Chen, Eunsol Choi, Michael Collins, Kelvin Guu, Hannaneh Hajishirzi, Kenton Lee, Jennimaria Palomaki, Colin Raffel, Adam Roberts, Tom Kwiatkowski, Patrick Lewis, Yuxiang Wu, Heinrich Küttler, Linqing Liu, Pasquale Minervini, Pontus Stenetorp, Sebastian Riedel, Sohee Yang, Minjoon Seo, Gautier Izacard, Fabio Petroni, Lucas Hosseini , et al. (28 additional authors not shown)

Abstract: We review the EfficientQA competition from NeurIPS 2020. The competition focused on open-domain question answering (QA), where systems take natural language questions as input and return natural language answers. The aim of the competition was to build systems that can predict correct answers while also satisfying strict on-disk memory budgets. These memory budgets were designed to encourage conte… ▽ More We review the EfficientQA competition from NeurIPS 2020. The competition focused on open-domain question answering (QA), where systems take natural language questions as input and return natural language answers. The aim of the competition was to build systems that can predict correct answers while also satisfying strict on-disk memory budgets. These memory budgets were designed to encourage contestants to explore the trade-off between storing retrieval corpora or the parameters of learned models. In this report, we describe the motivation and organization of the competition, review the best submissions, and analyze system predictions to inform a discussion of evaluation for open-domain QA. △ Less

Submitted 19 September, 2021; v1 submitted 31 December, 2020; originally announced January 2021.

Comments: 26 pages; Published in Proceedings of Machine Learning Research (PMLR), NeurIPS 2020 Competition and Demonstration Track

arXiv:2005.14253 [pdf, ps, other]

Empirical Evaluation of Pretraining Strategies for Supervised Entity Linking

Authors: Thibault Févry, Nicholas FitzGerald, Livio Baldini Soares, Tom Kwiatkowski

Abstract: In this work, we present an entity linking model which combines a Transformer architecture with large scale pretraining from Wikipedia links. Our model achieves the state-of-the-art on two commonly used entity linking datasets: 96.7% on CoNLL and 94.9% on TAC-KBP. We present detailed analyses to understand what design choices are important for entity linking, including choices of negative entity c… ▽ More In this work, we present an entity linking model which combines a Transformer architecture with large scale pretraining from Wikipedia links. Our model achieves the state-of-the-art on two commonly used entity linking datasets: 96.7% on CoNLL and 94.9% on TAC-KBP. We present detailed analyses to understand what design choices are important for entity linking, including choices of negative entity candidates, Transformer architecture, and input perturbations. Lastly, we present promising results on more challenging settings such as end-to-end entity linking and entity linking without in-domain training data. △ Less

Submitted 28 May, 2020; originally announced May 2020.

Comments: 11 pages, 8 figures, appearing at AKBC 2020

arXiv:2004.07202 [pdf, other]

Entities as Experts: Sparse Memory Access with Entity Supervision

Authors: Thibault Févry, Livio Baldini Soares, Nicholas FitzGerald, Eunsol Choi, Tom Kwiatkowski

Abstract: We focus on the problem of capturing declarative knowledge about entities in the learned parameters of a language model. We introduce a new model - Entities as Experts (EAE) - that can access distinct memories of the entities mentioned in a piece of text. Unlike previous efforts to integrate entity knowledge into sequence models, EAE's entity representations are learned directly from text. We show… ▽ More We focus on the problem of capturing declarative knowledge about entities in the learned parameters of a language model. We introduce a new model - Entities as Experts (EAE) - that can access distinct memories of the entities mentioned in a piece of text. Unlike previous efforts to integrate entity knowledge into sequence models, EAE's entity representations are learned directly from text. We show that EAE's learned representations capture sufficient knowledge to answer TriviaQA questions such as "Which Dr. Who villain has been played by Roger Delgado, Anthony Ainley, Eric Roberts?", outperforming an encoder-generator Transformer model with 10x the parameters. According to the LAMA knowledge probes, EAE contains more factual knowledge than a similarly sized BERT, as well as previous approaches that integrate external sources of entity knowledge. Because EAE associates parameters with specific entities, it only needs to access a fraction of its parameters at inference time, and we show that the correct identification and representation of entities is essential to EAE's performance. △ Less

Submitted 6 October, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

arXiv:2003.05002 [pdf]

TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages

Authors: Jonathan H. Clark, Eunsol Choi, Michael Collins, Dan Garrette, Tom Kwiatkowski, Vitaly Nikolaev, Jennimaria Palomaki

Abstract: Confidently making progress on multilingual modeling requires challenging, trustworthy evaluations. We present TyDi QA---a question answering dataset covering 11 typologically diverse languages with 204K question-answer pairs. The languages of TyDi QA are diverse with regard to their typology---the set of linguistic features each language expresses---such that we expect models performing well on t… ▽ More Confidently making progress on multilingual modeling requires challenging, trustworthy evaluations. We present TyDi QA---a question answering dataset covering 11 typologically diverse languages with 204K question-answer pairs. The languages of TyDi QA are diverse with regard to their typology---the set of linguistic features each language expresses---such that we expect models performing well on this set to generalize across a large number of the world's languages. We present a quantitative analysis of the data quality and example-level qualitative linguistic analyses of observed language phenomena that would not be found in English-only corpora. To provide a realistic information-seeking task and avoid priming effects, questions are written by people who want to know the answer, but don't know the answer yet, and the data is collected directly in each language without the use of translation. △ Less

Submitted 10 March, 2020; originally announced March 2020.

Comments: To appear in Transactions of the Association for Computational Linguistics (TACL) 2020. Please use this as the citation

arXiv:2001.03765 [pdf, other]

Learning Cross-Context Entity Representations from Text

Authors: Jeffrey Ling, Nicholas FitzGerald, Zifei Shan, Livio Baldini Soares, Thibault Févry, David Weiss, Tom Kwiatkowski

Abstract: Language modeling tasks, in which words, or word-pieces, are predicted on the basis of a local context, have been very effective for learning word embeddings and context dependent representations of phrases. Motivated by the observation that efforts to code world knowledge into machine readable knowledge bases or human readable encyclopedias tend to be entity-centric, we investigate the use of a f… ▽ More Language modeling tasks, in which words, or word-pieces, are predicted on the basis of a local context, have been very effective for learning word embeddings and context dependent representations of phrases. Motivated by the observation that efforts to code world knowledge into machine readable knowledge bases or human readable encyclopedias tend to be entity-centric, we investigate the use of a fill-in-the-blank task to learn context independent representations of entities from the text contexts in which those entities were mentioned. We show that large scale training of neural models allows us to learn high quality entity representations, and we demonstrate successful results on four domains: (1) existing entity-level ty** benchmarks, including a 64% error reduction over previous work on TypeNet (Murty et al., 2018); (2) a novel few-shot category reconstruction task; (3) existing entity linking benchmarks, where we match the state-of-the-art on CoNLL-Aida without linking-specific features and obtain a score of 89.8% on TAC-KBP 2010 without using any alias table, external knowledge base or in domain training data and (4) answering trivia questions, which uniquely identify entities. Our global entity representations encode fine-grained type categories, such as Scottish footballers, and can answer trivia questions such as: Who was the last inmate of Spandau jail in Berlin? △ Less

Submitted 11 January, 2020; originally announced January 2020.

arXiv:1906.05807 [pdf, other]

Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index

Authors: Minjoon Seo, **hyuk Lee, Tom Kwiatkowski, Ankur P. Parikh, Ali Farhadi, Hannaneh Hajishirzi

Abstract: Existing open-domain question answering (QA) models are not suitable for real-time usage because they need to process several long documents on-demand for every input query. In this paper, we introduce the query-agnostic indexable representation of document phrases that can drastically speed up open-domain QA and also allows us to reach long-tail targets. In particular, our dense-sparse phrase enc… ▽ More Existing open-domain question answering (QA) models are not suitable for real-time usage because they need to process several long documents on-demand for every input query. In this paper, we introduce the query-agnostic indexable representation of document phrases that can drastically speed up open-domain QA and also allows us to reach long-tail targets. In particular, our dense-sparse phrase encoding effectively captures syntactic, semantic, and lexical information of the phrases and eliminates the pipeline filtering of context documents. Leveraging optimization strategies, our model can be trained in a single 4-GPU server and serve entire Wikipedia (up to 60 billion phrases) under 2TB with CPUs only. Our experiments on SQuAD-Open show that our model is more accurate than DrQA (Chen et al., 2017) with 6000x reduced computational cost, which translates into at least 58x faster end-to-end inference benchmark on CPUs. △ Less

Submitted 14 June, 2019; v1 submitted 13 June, 2019; originally announced June 2019.

Comments: ACL 2019; Code & demo available at https://nlp.cs.washington.edu/denspi/ ; Added comparison to Weaver (Raison et al., 2018)

arXiv:1906.03158 [pdf, other]

Matching the Blanks: Distributional Similarity for Relation Learning

Authors: Livio Baldini Soares, Nicholas FitzGerald, Jeffrey Ling, Tom Kwiatkowski

Abstract: General purpose relation extractors, which can model arbitrary relations, are a core aspiration in information extraction. Efforts have been made to build general purpose extractors that represent relations with their surface forms, or which jointly embed surface forms with relations from an existing knowledge graph. However, both of these approaches are limited in their ability to generalize. In… ▽ More General purpose relation extractors, which can model arbitrary relations, are a core aspiration in information extraction. Efforts have been made to build general purpose extractors that represent relations with their surface forms, or which jointly embed surface forms with relations from an existing knowledge graph. However, both of these approaches are limited in their ability to generalize. In this paper, we build on extensions of Harris' distributional hypothesis to relations, as well as recent advances in learning text representations (specifically, BERT), to build task agnostic relation representations solely from entity-linked text. We show that these representations significantly outperform previous work on exemplar based relation extraction (FewRel) even without using any of that task's training data. We also show that models initialized with our task agnostic representations, and then tuned on supervised relation extraction datasets, significantly outperform the previous methods on SemEval 2010 Task 8, KBP37, and TACRED. △ Less

Submitted 7 June, 2019; originally announced June 2019.

Comments: To appear at ACL 2019

arXiv:1905.10044 [pdf, ps, other]

BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions

Authors: Christopher Clark, Kenton Lee, Ming-Wei Chang, Tom Kwiatkowski, Michael Collins, Kristina Toutanova

Abstract: In this paper we study yes/no questions that are naturally occurring --- meaning that they are generated in unprompted and unconstrained settings. We build a reading comprehension dataset, BoolQ, of such questions, and show that they are unexpectedly challenging. They often query for complex, non-factoid information, and require difficult entailment-like inference to solve. We also explore the eff… ▽ More In this paper we study yes/no questions that are naturally occurring --- meaning that they are generated in unprompted and unconstrained settings. We build a reading comprehension dataset, BoolQ, of such questions, and show that they are unexpectedly challenging. They often query for complex, non-factoid information, and require difficult entailment-like inference to solve. We also explore the effectiveness of a range of transfer learning baselines. We find that transferring from entailment data is more effective than transferring from paraphrase or extractive QA data, and that it, surprisingly, continues to be very beneficial even when starting from massive pre-trained language models such as BERT. Our best method trains BERT on MultiNLI and then re-trains it on our train set. It achieves 80.4% accuracy compared to 90% accuracy of human annotators (and 62% majority-baseline), leaving a significant gap for future work. △ Less

Submitted 24 May, 2019; originally announced May 2019.

Comments: In NAACL 2019

arXiv:1901.05500 [pdf, other]

doi 10.1038/s41586-018-0826-3

Signatures of a jet cocoon in early spectra of a supernova associated with a $γ$-ray burst

Authors: L. Izzo, A. de Ugarte Postigo, K. Maeda, C. C. Thöne, D. A. Kann, M. Della Valle, A. Sagues Carracedo, M. J. Michałowski, P. Schady, S. Schmidl, J. Selsing, R. L. C. Starling, A. Suzuki, K. Bensch, J. Bolmer, S. Campana, Z. Cano, S. Covino, J. P. U. Fynbo, D. H. Hartmann, K. E. Heintz, J. Hjorth, J. Japelj, K. Kamiński, L. Kaper , et al. (17 additional authors not shown)

Abstract: Long gamma-ray bursts mark the death of massive stars, as revealed by their association with energetic broad-lined stripped-envelope supernovae. The scarcity of nearby events and the brightness of the GRB afterglow, dominating the first days of emission, have so far prevented the study of the very early stages of the GRB-SN evolution. Here we present detailed, multi-epoch spectroscopic observation… ▽ More Long gamma-ray bursts mark the death of massive stars, as revealed by their association with energetic broad-lined stripped-envelope supernovae. The scarcity of nearby events and the brightness of the GRB afterglow, dominating the first days of emission, have so far prevented the study of the very early stages of the GRB-SN evolution. Here we present detailed, multi-epoch spectroscopic observations of SN 2017iuk, associated with GRB 171205A which display features at extremely high expansion velocities of $\sim$ 100,000 km s$^{-1}$ within the first day after the burst. These high-velocity components are characterized by chemical abundances different from those observed in the ejecta of SN 2017iuk at later times. Using spectral synthesis models developed for the SN 2017iuk, we explain these early features as originating not from the supernova ejecta, but from a hot cocoon generated by the energy injection of a mildly-relativistic GRB jet expanding into the medium surrounding the progenitor star. This cocoon becomes rapidly transparent and is outshone by the supernova emission which starts dominating three days after the burst. These results proves that the jet plays an important role not only in powering the GRB event but also its associated supernova. △ Less

Submitted 16 January, 2019; originally announced January 2019.

Comments: 30 pages, 11 figures, 4 tables. Original author manuscript version of a Letter published in Nature journal. Full article available at https://goo.gl/7y9ZeM

arXiv:1901.04936 [pdf, other]

Incremental Reading for Question Answering

Authors: Samira Abnar, Tania Bedrax-weiss, Tom Kwiatkowski, William W. Cohen

Abstract: Any system which performs goal-directed continual learning must not only learn incrementally but process and absorb information incrementally. Such a system also has to understand when its goals have been achieved. In this paper, we consider these issues in the context of question answering. Current state-of-the-art question answering models reason over an entire passage, not incrementally. As we… ▽ More Any system which performs goal-directed continual learning must not only learn incrementally but process and absorb information incrementally. Such a system also has to understand when its goals have been achieved. In this paper, we consider these issues in the context of question answering. Current state-of-the-art question answering models reason over an entire passage, not incrementally. As we will show, naive approaches to incremental reading, such as restriction to unidirectional language models in the model, perform poorly. We present extensions to the DocQA [2] model to allow incremental reading without loss of accuracy. The model also jointly learns to provide the best answer given the text that is seen so far and predict whether this best-so-far answer is sufficient. △ Less

Submitted 15 January, 2019; originally announced January 2019.

arXiv:1804.07726 [pdf, other]

Phrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension

Authors: Minjoon Seo, Tom Kwiatkowski, Ankur P. Parikh, Ali Farhadi, Hannaneh Hajishirzi

Abstract: We formalize a new modular variant of current question answering tasks by enforcing complete independence of the document encoder from the question encoder. This formulation addresses a key challenge in machine comprehension by requiring a standalone representation of the document discourse. It additionally leads to a significant scalability advantage since the encoding of the answer candidate phr… ▽ More We formalize a new modular variant of current question answering tasks by enforcing complete independence of the document encoder from the question encoder. This formulation addresses a key challenge in machine comprehension by requiring a standalone representation of the document discourse. It additionally leads to a significant scalability advantage since the encoding of the answer candidate phrases in the document can be pre-computed and indexed offline for efficient retrieval. We experiment with baseline models for the new task, which achieve a reasonable accuracy but significantly underperform unconstrained QA models. We invite the QA research community to engage in Phrase-Indexed Question Answering (PIQA, pika) for closing the gap. The leaderboard is at: nlp.cs.washington.edu/piqa △ Less

Submitted 26 September, 2018; v1 submitted 20 April, 2018; originally announced April 2018.

Comments: EMNLP 2018 short; 6 pages

arXiv:1711.00894 [pdf, other]

Multi-Mention Learning for Reading Comprehension with Neural Cascades

Authors: Swabha Swayamdipta, Ankur P. Parikh, Tom Kwiatkowski

Abstract: Reading comprehension is a challenging task, especially when executed across longer or across multiple evidence documents, where the answer is likely to reoccur. Existing neural architectures typically do not scale to the entire evidence, and hence, resort to selecting a single passage in the document (either via truncation or other means), and carefully searching for the answer within that passag… ▽ More Reading comprehension is a challenging task, especially when executed across longer or across multiple evidence documents, where the answer is likely to reoccur. Existing neural architectures typically do not scale to the entire evidence, and hence, resort to selecting a single passage in the document (either via truncation or other means), and carefully searching for the answer within that passage. However, in some cases, this strategy can be suboptimal, since by focusing on a specific passage, it becomes difficult to leverage multiple mentions of the same answer throughout the document. In this work, we take a different approach by constructing lightweight models that are combined in a cascade to find the answer. Each submodel consists only of feed-forward networks equipped with an attention mechanism, making it trivially parallelizable. We show that our approach can scale to approximately an order of magnitude larger evidence documents and can aggregate information at the representation level from multiple mentions of each answer candidate across the document. Empirically, our approach achieves state-of-the-art performance on both the Wikipedia and web domains of the TriviaQA dataset, outperforming more complex, recurrent architectures. △ Less

Submitted 30 May, 2018; v1 submitted 2 November, 2017; originally announced November 2017.

Comments: Proceedings of ICLR 2018

arXiv:1707.07503 [pdf, ps, other]

doi 10.1051/0004-6361/201630104

Shape and spin determination of Barbarian asteroids

Authors: M. Devogèle, P. Tanga, P. Bendjoya, J. P. Rivet, J. Surdej, J. Hanus, L. Abe, P. Antonini, R. A. Artola, M. Audejean, R. Behrend, F. Berski, J. G. Bosch, M. Bronikowska, A. Carbognani, F. Char, M. -J. Kim, Y. -J. Choi, C. A. Colazo, J. Coloma, D. Coward, R. Durkee, O. Erece, E. Forne, P. Hickson , et al. (29 additional authors not shown)

Abstract: Context. The so-called Barbarian asteroids share peculiar, but common polarimetric properties, probably related to both their shape and composition. They are named after (234) Barbara, the first on which such properties were identified. As has been suggested, large scale topographic features could play a role in the polarimetric response, if the shapes of Barbarians are particularly irregular and… ▽ More Context. The so-called Barbarian asteroids share peculiar, but common polarimetric properties, probably related to both their shape and composition. They are named after (234) Barbara, the first on which such properties were identified. As has been suggested, large scale topographic features could play a role in the polarimetric response, if the shapes of Barbarians are particularly irregular and present a variety of scattering/incidence angles. This idea is supported by the shape of (234) Barbara, that appears to be deeply excavated by wide concave areas revealed by photometry and stellar occultations. Aims. With these motivations, we started an observation campaign to characterise the shape and rotation properties of Small Main- Belt Asteroid Spectroscopic Survey (SMASS) type L and Ld asteroids. As many of them show long rotation periods, we activated a worldwide network of observers to obtain a dense temporal coverage. Methods. We used light-curve inversion technique in order to determine the sidereal rotation periods of 15 asteroids and the con- vergence to a stable shape and pole coordinates for 8 of them. By using available data from occultations, we are able to scale some shapes to an absolute size. We also study the rotation periods of our sample looking for confirmation of the suspected abundance of asteroids with long rotation periods. Results. Our results show that the shape models of our sample do not seem to have peculiar properties with respect to asteroids with similar size, while an excess of slow rotators is most probably confirmed. △ Less

Submitted 24 July, 2017; originally announced July 2017.

Journal ref: A&A 607, A119 (2017)

arXiv:1707.04427 [pdf, other]

doi 10.1093/mnras/stx1343

Statistical analysis of the ambiguities in the asteroid period determinations

Authors: M. Butkiewicz-Bąk, T. Kwiatkowski, P. Bartczak, G. Dudziński, A. Marciniak

Abstract: Among asteroids there exist ambiguities in their rotation period determinations. They are due to incomplete coverage of the rotation, noise and/or aliases resulting from gaps between separate lightcurves. To help to remove such uncertainties, basic characteristic of the lightcurves resulting from constraints imposed by the asteroid shapes and geometries of observations should be identified. We sim… ▽ More Among asteroids there exist ambiguities in their rotation period determinations. They are due to incomplete coverage of the rotation, noise and/or aliases resulting from gaps between separate lightcurves. To help to remove such uncertainties, basic characteristic of the lightcurves resulting from constraints imposed by the asteroid shapes and geometries of observations should be identified. We simulated light variations of asteroids which shapes were modelled as Gaussian random spheres, with random orientations of spin vectors and phase angles changed every $5^\circ$ from $0^\circ$ to $65^\circ$. This produced 1.4 mln lightcurves. For each simulated lightcurve Fourier analysis has been made and the harmonic of the highest amplitude was recorded. From the statistical point of view, all lightcurves observed at phase angles $α< 30^\circ$, with peak-to-peak amplitudes $A>0.2$ mag are bimodal. Second most frequently dominating harmonic is the first one, with the 3rd harmonic following right after. For 1% of lightcurves with amplitudes $A < 0.1$ mag and phase angles $α< 40^\circ$ 4th harmonic dominates. △ Less

Submitted 14 July, 2017; originally announced July 2017.

Comments: 9 pages, 3 figures, 6 tables, accepted for publication in Monthly Notices of the Royal Astronomical Society

Journal ref: Monthly Notices of the Royal Astronomical Society Volume 470, Issue 2, p.1314-1320 (2017)

arXiv:1701.07725 [pdf, ps, other]

doi 10.1093/mnras/stw3075

The olivine-dominated composition of the Eureka family of Mars Trojan asteroids

Authors: G. Borisov, A. Christou, S. Bagnulo, A. Cellino, T. Kwiatkowski, A. Dell'Oro

Abstract: We have used the XSHOOTER echelle spectrograph on the European Southern Obseratory (ESO) Very Large Telescope (VLT) to obtain UVB-VIS-NIR (ultraviolet-blue (UVB), visible (VIS) and near-infrared (NIR)) reflectance spectra of two members of the Eureka family of L5 Mars Trojans, in order to test a genetic relationship to Eureka. In addition to obtaining spectra, we also carried out VRI photometry of… ▽ More We have used the XSHOOTER echelle spectrograph on the European Southern Obseratory (ESO) Very Large Telescope (VLT) to obtain UVB-VIS-NIR (ultraviolet-blue (UVB), visible (VIS) and near-infrared (NIR)) reflectance spectra of two members of the Eureka family of L5 Mars Trojans, in order to test a genetic relationship to Eureka. In addition to obtaining spectra, we also carried out VRI photometry of one of the VLT targets using the 2-m telescope at the Bulgarian National Astronomical Observatory - Rozhen and the two-channel focal reducer. We found that these asteroids belong to the olivine-dominated A, or Sa, taxonomic class. As Eureka itself is also an olivine-dominated asteroid, it is likely that all family asteroids share a common origin and composition. We discuss the significance of these results in terms of the origin of the martian Trojan population. △ Less

Submitted 26 January, 2017; originally announced January 2017.

Comments: 7 pages, 6 figures, 3 tables

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 466, Issue 1, p.489-495 (2017)

arXiv:1611.01436 [pdf, other]

Learning Recurrent Span Representations for Extractive Question Answering

Authors: Kenton Lee, Shimi Salant, Tom Kwiatkowski, Ankur Parikh, Dipanjan Das, Jonathan Berant

Abstract: The reading comprehension task, that asks questions about a given evidence document, is a central problem in natural language understanding. Recent formulations of this task have typically focused on answer selection from a set of candidates pre-defined manually or through the use of an external NLP pipeline. However, Rajpurkar et al. (2016) recently released the SQuAD dataset in which the answers… ▽ More The reading comprehension task, that asks questions about a given evidence document, is a central problem in natural language understanding. Recent formulations of this task have typically focused on answer selection from a set of candidates pre-defined manually or through the use of an external NLP pipeline. However, Rajpurkar et al. (2016) recently released the SQuAD dataset in which the answers can be arbitrary strings from the supplied text. In this paper, we focus on this answer extraction task, presenting a novel model architecture that efficiently builds fixed length representations of all spans in the evidence document with a recurrent network. We show that scoring explicit span representations significantly improves performance over other approaches that factor the prediction into separate predictions about words or start and end markers. Our approach improves upon the best published results of Wang & Jiang (2016) by 5% and decreases the error of Rajpurkar et al.'s baseline by > 50%. △ Less

Submitted 17 March, 2017; v1 submitted 4 November, 2016; originally announced November 2016.

ACM Class: I.2.7

arXiv:1408.4288 [pdf, ps, other]

doi 10.1051/0004-6361/201323250

Selecting asteroids for a targeted spectroscopic survey

Authors: D. A. Oszkiewicz, T. Kwiatkowski, T. Tomov, M. Birlan, S. Geier, A. Penttilä, M. Polińska

Abstract: Asteroid spectroscopy reflects surface mineralogy. There are few thousand asteroids whose surfaces have been observed spectrally. Determining the surface properties of those objects is important for many practical and scientific applications, such as for example develo** impact deflection strategies or studying history and evolution of the Solar System and planet formation. The aim of this stu… ▽ More Asteroid spectroscopy reflects surface mineralogy. There are few thousand asteroids whose surfaces have been observed spectrally. Determining the surface properties of those objects is important for many practical and scientific applications, such as for example develo** impact deflection strategies or studying history and evolution of the Solar System and planet formation. The aim of this study is to develop a pre-selection method that can be utilized in searching for asteroids of any taxonomic complex. The method could then be utilized im multiple applications such as searching for the missing V-types or looking for primitive asteroids. We used the Bayes Naive Classifier combined with observations obtained in the course of the Sloan Digital Sky Survey and the Wide-field Infrared Survey Explorer surveys as well as a database of asteroid phase curves for asteroids with known taxonomic type. Using the new classification method we have selected a number of possible V-type candidates. Some of the candidates were than spectrally observed at the Nordic Optical Telescope and South African Large Telescope. We have developed and tested the new pre-selection method. We found three asteroids in the mid/outer Main Belt that are likely of differentiated type. Near-Infrared are still required to confirm this discovery. Similarly to other studies we found that V-type candidates cluster around the Vesta family and are rare in the mid/oter Main Belt. The new method shows that even largely explored large databases combined together could still be further exploited in for example solving the missing dunite problem. △ Less

Submitted 19 August, 2014; originally announced August 2014.

Comments: accepted to AA

Journal ref: A&A 572, A29 (2014)

arXiv:1301.6943 [pdf, ps, other]

doi 10.1051/0004-6361/201220701

Asteroids' physical models from combined dense and sparse photometry and scaling of the YORP effect by the observed obliquity distribution

Authors: J. Hanuš, J. Ďurech, M. Brož, A. Marciniak, B. D. Warner, F. Pilcher, R. Stephens, R. Behrend, B. Carry, D. Čapek, P. Antonini, M. Audejean, K. Augustesen, E. Barbotin, P. Baudouin, A. Bayol, L. Bernasconi, W. Borczyk, J. -G. Bosch, E. Brochard, L. Brunetto, S. Casulli, A. Cazenave, S. Charbonnel, B. Christophe , et al. (95 additional authors not shown)

Abstract: The larger number of models of asteroid shapes and their rotational states derived by the lightcurve inversion give us better insight into both the nature of individual objects and the whole asteroid population. With a larger statistical sample we can study the physical properties of asteroid populations, such as main-belt asteroids or individual asteroid families, in more detail. Shape models can… ▽ More The larger number of models of asteroid shapes and their rotational states derived by the lightcurve inversion give us better insight into both the nature of individual objects and the whole asteroid population. With a larger statistical sample we can study the physical properties of asteroid populations, such as main-belt asteroids or individual asteroid families, in more detail. Shape models can also be used in combination with other types of observational data (IR, adaptive optics images, stellar occultations), e.g., to determine sizes and thermal properties. We use all available photometric data of asteroids to derive their physical models by the lightcurve inversion method and compare the observed pole latitude distributions of all asteroids with known convex shape models with the simulated pole latitude distributions. We used classical dense photometric lightcurves from several sources and sparse-in-time photometry from the U.S. Naval Observatory in Flagstaff, Catalina Sky Survey, and La Palma surveys (IAU codes 689, 703, 950) in the lightcurve inversion method to determine asteroid convex models and their rotational states. We also extended a simple dynamical model for the spin evolution of asteroids used in our previous paper. We present 119 new asteroid models derived from combined dense and sparse-in-time photometry. We discuss the reliability of asteroid shape models derived only from Catalina Sky Survey data (IAU code 703) and present 20 such models. By using different values for a scaling parameter cYORP (corresponds to the magnitude of the YORP momentum) in the dynamical model for the spin evolution and by comparing synthetics and observed pole-latitude distributions, we were able to constrain the typical values of the cYORP parameter as between 0.05 and 0.6. △ Less

Submitted 29 January, 2013; originally announced January 2013.

Comments: Accepted for publication in A&A, January 15, 2013

arXiv:1210.3486 [pdf, ps, other]

doi 10.1051/0004-6361/201220156

Physical and dynamical characterisation of low Delta-V NEA (190491) 2000 FJ10

Authors: A. A. Christou, T. Kwiatkowski, M. Butkiewicz, A. Gulbis, C. W. Hergenrother, S. Duddy, A. Fitzsimmons

Abstract: We investigated the physical properties and dynamical evolution of Near Earth Asteroid (NEA) (190491) 2000 FJ10 in order to assess the suitability of this accessible NEA as a space mission target. Photometry and colour determination were carried out with the 1.54 m Kuiper Telescope and the 10 m Southern African Large Telescope during the object's recent favourable apparition in 2011-12. During the… ▽ More We investigated the physical properties and dynamical evolution of Near Earth Asteroid (NEA) (190491) 2000 FJ10 in order to assess the suitability of this accessible NEA as a space mission target. Photometry and colour determination were carried out with the 1.54 m Kuiper Telescope and the 10 m Southern African Large Telescope during the object's recent favourable apparition in 2011-12. During the earlier 2008 apparition, a spectrum of the object in the 6000-9000 Angstrom region was obtained with the 4.2 m William Herschel Telescope. Interpretation of the observational results was aided by numerical simulations of 1000 dynamical clones of 2000 FJ10 up to 10^6 yr in the past and in the future. The asteroid's spectrum and colours determined by our observations suggest a taxonomic classification within the S-complex although other classifications (V, D, E, M, P) cannot be ruled out. On this evidence, it is unlikely to be a primitive, relatively unaltered remnant from the early history of the solar system and thus a low priority target for robotic sample return. Our photometry placed a lower bound of 2 hrs to the asteroid's rotation period. Its absolute magnitude was estimated to be 21.54+-0.1 which, for a typical S-complex albedo, translates into a diameter of 130+-20 m. Our dynamical simulations show that it has likely been an Amor for the past 10^5 yr. Although currently not Earth-crossing, it will likely become so during the period 50 - 100 kyr in the future. It may have arrived from the inner or central Main Belt > 1 Myr ago as a former member of a low-inclination S-class asteroid family. Its relatively slow rotation and large size make it a suitable destination for a human mission. We show that ballistic Earth-190491-Earth transfer trajectories with Delta-V < 2 km s^-1 at the asteroid exist between 2052 and 2061. △ Less

Submitted 12 October, 2012; originally announced October 2012.

Comments: 2 Tables, 11 Figures, accepted for publication in Astronomy & Astrophysics

arXiv:1004.0420 [pdf, ps, other]

Four unusual novae observed in Torun: V2362 Cyg, V2467 Cyg, V458 Vul, V2491 Cyg

Authors: E. Ragan, M. Mikolajewski, T. Tomov, W. Dimitrow, M. Fagas, T. Kwiatkowski, A. Schwarzenberg-Czerny, Ch. Buil, E. Swierczynski, T. Brozek, M. Cikala, K. Czart, A. Fidos, S. Frackowiak, C. Galan, A. Karska, M. Klosinska, M. Lewandowski, T. Radomski, P. Rozanski, M. Wiecek, P. Wychudzki, A. Zajczyk, M. Zielinska

Abstract: We present photometric and spectral observation for four novae: V2362 Cyg, V2467 Cyg, V458 Vul, V2491 Cyg. All objects belongs to the "fast novae" class. For these stars we observed different departures from a typical behavior in the light curve and spectrum. We present photometric and spectral observation for four novae: V2362 Cyg, V2467 Cyg, V458 Vul, V2491 Cyg. All objects belongs to the "fast novae" class. For these stars we observed different departures from a typical behavior in the light curve and spectrum. △ Less

Submitted 1 June, 2010; v1 submitted 3 April, 2010; originally announced April 2010.

Comments: 4 pages, 4 figures

arXiv:0911.0554 [pdf, ps, other]

doi 10.1111/j.1365-2966.2009.15971.x

Absolute properties of the main-sequence eclipsing binary FM Leo

Authors: M. Ratajczak, T. Kwiatkowski, A. Schwarzenberg-Czerny, W. Dimitrov, M. Konacki, K. G. Helminiak, P. Bartczak, M. Fagas, K. Kaminski, P. Kankiewicz, W. Borczyk, A. Rozek

Abstract: First spectroscopic and new photometric observations of the eclipsing binary FM Leo are presented. The main aims were to determine orbital and stellar parameters of two components and their evolutionary stage. First spectroscopic observations of the system were obtained with DDO and PST spectrographs. The results of the orbital solution from radial velocity curves are combined with those derived… ▽ More First spectroscopic and new photometric observations of the eclipsing binary FM Leo are presented. The main aims were to determine orbital and stellar parameters of two components and their evolutionary stage. First spectroscopic observations of the system were obtained with DDO and PST spectrographs. The results of the orbital solution from radial velocity curves are combined with those derived from the light-curve analysis (ASAS-3 photometry and supplementary observations of eclipses with 1 m and 0.35 m telescopes) to derive orbital and stellar parameters. JKTEBOP, Wilson-Devinney binary modelling codes and a two-dimensional cross-correlation (TODCOR) method were applied for the analysis. We find the masses to be M_1 = 1.318 $\pm$ 0.007 and M_2 = 1.287 $\pm$ 0.007 M_sun, the radii to be R_1 = 1.648 $\pm$ 0.043 and R_2 = 1.511 $\pm$ 0.049 R_sun for primary and secondary stars, respectively. The evolutionary stage of the system is briefly discussed by comparing physical parameters with current stellar evolution models. We find the components are located at the main sequence, with an age of about 3 Gyr. △ Less

Submitted 3 November, 2009; originally announced November 2009.

Comments: 5 pages, 4 figures, to appear in MNRAS

Report number: 0911.0550

arXiv:0904.0600 [pdf, ps, other]

doi 10.1111/j.1365-2966.2009.14865.x

V440 Per: the longest period overtone Cepheid

Authors: R. Baranowski, R. Smolec, W. Dimitrov, T. Kwiatkowski, A. Schwarzenberg-Czerny, P. Bartczak, M. Fagas, W. Borczyk, K. Kaminski, P. Moskalik, R. Ratajczak, A. Rozek

Abstract: V440 Per is a Population I Cepheid with the period of 7.57 day and low amplitude, almost sinusoidal light and radial velocity curves. With no reliable data on the 1st harmonic, its pulsation mode identification remained controversial. We obtained a radial velocity curve of V440 Per with our new high precision and high throughput Poznan Spectroscopic Telescope. Our data reach the accuracy of 130… ▽ More V440 Per is a Population I Cepheid with the period of 7.57 day and low amplitude, almost sinusoidal light and radial velocity curves. With no reliable data on the 1st harmonic, its pulsation mode identification remained controversial. We obtained a radial velocity curve of V440 Per with our new high precision and high throughput Poznan Spectroscopic Telescope. Our data reach the accuracy of 130 m/s per individual measurement and yield a secure detection of the 1st harmonic with the amplitude of A_2= 140+/- 15 m/s. The velocity Fourier phase φ_21 of V440 Per is inconsistent at the 7.25 σlevel with those of the fundamental mode Cepheids, implying that the star must be an overtone Cepheid, as originally proposed by Kienzle et al.(1999). Thus, V440 Per becomes the longest period Cepheid with the securely established overtone pulsations. We show, that the convective nonlinear pulsation hydrocode can reproduce the Fourier parameters of V440 Per very well. Requirement to match the observed properties of V440 Per constrains free parameters of the dynamical convection model used in the pulsation calculations, in particular the radiative losses parameter. △ Less

Submitted 3 April, 2009; originally announced April 2009.

Comments: Submitted to MNRAS

Journal ref: Mon.Not.Roy.Astron.Soc.396:2194-2200, 2009

arXiv:astro-ph/0307019 [pdf, ps, other]

Radial velocities of "slow movers" - call for observations

Authors: Piotr A. Dybczynski, Tomasz Kwiatkowski

Abstract: This paper presents a list of suggested stars for radial velocity measurements. We explain here in brief the research project for which the radial velocity of the "slow movers" i.e. small proper motion stars are necessary. Basing on this study we prepared a list of 1100 stellar targets with very accurate positions, proper motions and trigonometric parallaxes but without radial velocity measureme… ▽ More This paper presents a list of suggested stars for radial velocity measurements. We explain here in brief the research project for which the radial velocity of the "slow movers" i.e. small proper motion stars are necessary. Basing on this study we prepared a list of 1100 stellar targets with very accurate positions, proper motions and trigonometric parallaxes but without radial velocity measurements. Distributions of stellar brightnesses and spectral types among these stars are presented as well as its "most wanted" subset. We announce the begin of the radial velocity measurements to be conducted with our new echelle spectrograph just put into operation and offer some coordination for observations of targets that cannot be reached from our location. △ Less

Submitted 3 October, 2003; v1 submitted 1 July, 2003; originally announced July 2003.

Comments: 6 pages, 2 figures; changed LaTeX style; minor changes in the text

Showing 1–35 of 35 results for author: Kwiatkowski, T