Skip to main content

Showing 1–48 of 48 results for author: Accomazzi, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.10725  [pdf, other

    cs.CL cs.IR

    INDUS: Effective and Efficient Language Models for Scientific Applications

    Authors: Bishwaranjan Bhattacharjee, Aashka Trivedi, Masayasu Muraoka, Muthukumaran Ramasubramanian, Takuma Udagawa, Iksha Gurung, Rong Zhang, Bharath Dandala, Rahul Ramachandran, Manil Maskey, Kaylin Bugbee, Mike Little, Elizabeth Fancher, Lauren Sanders, Sylvain Costes, Sergi Blanco-Cuaresma, Kelly Lockhart, Thomas Allen, Felix Grezes, Megan Ansdell, Alberto Accomazzi, Yousef El-Kurdi, Davis Wertheimer, Birgit Pfitzmann, Cesar Berrospi Ramis , et al. (9 additional authors not shown)

    Abstract: Large language models (LLMs) trained on general domain corpora showed remarkable results on natural language processing (NLP) tasks. However, previous research demonstrated LLMs trained using domain-focused corpora perform better on specialized tasks. Inspired by this pivotal insight, we developed INDUS, a comprehensive suite of LLMs tailored for the Earth science, biology, physics, heliophysics,… ▽ More

    Submitted 20 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  2. arXiv:2401.09685  [pdf, ps, other

    astro-ph.IM cs.DL

    Decades of Transformation: Evolution of the NASA Astrophysics Data System's Infrastructure

    Authors: Alberto Accomazzi

    Abstract: The NASA Astrophysics Data System (ADS) is the primary Digital Library portal for researchers in astronomy and astrophysics. Over the past 30 years, the ADS has gone from being an astronomy-focused bibliographic database to an open digital library system supporting research in space and (soon) earth sciences. This paper describes the evolution of the ADS system, its capabilities, and the technolog… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 10 pages, 3 figures, submitted to the ADASS 2023 proceedings

  3. arXiv:2312.14211  [pdf, ps, other

    cs.CL astro-ph.IM cs.AI

    Experimenting with Large Language Models and vector embeddings in NASA SciX

    Authors: Sergi Blanco-Cuaresma, Ioana Ciucă, Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Kelly E. Lockhart, Felix Grezes, Thomas Allen, Golnaz Shapurian, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Daniel Chivvis, Fernanda de Macedo Alves, Jean-Claude Paquin, Jennifer Bartlett, Mugdha Polimera, Stephanie Jarmak

    Abstract: Open-source Large Language Models enable projects such as NASA SciX (i.e., NASA ADS) to think out of the box and try alternative approaches for information retrieval and data augmentation, while respecting data copyright and users' privacy. However, when large language models are directly prompted with questions without any context, they are prone to hallucination. At NASA SciX we have developed a… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear in the proceedings of the 33th annual international Astronomical Data Analysis Software & Systems (ADASS XXXIII)

  4. arXiv:2312.08579  [pdf, other

    cs.CL astro-ph.IM cs.LG

    Identifying Planetary Names in Astronomy Papers: A Multi-Step Approach

    Authors: Golnaz Shapurian, Michael J Kurtz, Alberto Accomazzi

    Abstract: The automatic identification of planetary feature names in astronomy publications presents numerous challenges. These features include craters, defined as roughly circular depressions resulting from impact or volcanic activity; dorsas, which are elongate raised structures or wrinkle ridges; and lacus, small irregular patches of dark, smooth material on the Moon, referred to as "lake" (Planetary Na… ▽ More

    Submitted 17 December, 2023; v1 submitted 13 December, 2023; originally announced December 2023.

  5. arXiv:2309.06126  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.GA astro-ph.HE cs.CL cs.LG

    AstroLLaMA: Towards Specialized Foundation Models in Astronomy

    Authors: Tuan Dung Nguyen, Yuan-Sen Ting, Ioana Ciucă, Charlie O'Neill, Ze-Chang Sun, Maja Jabłońska, Sandor Kruk, Ernest Perkowski, Jack Miller, Jason Li, Josh Peek, Kartheik Iyer, Tomasz Różański, Pranav Khetarpal, Sharaf Zaman, David Brodrick, Sergio J. Rodríguez Méndez, Thang Bui, Alyssa Goodman, Alberto Accomazzi, Jill Naiman, Jesse Cranney, Kevin Schawinski, UniverseTBD

    Abstract: Large language models excel in many human-language tasks but often falter in highly specialized domains like scholarly astronomy. To bridge this gap, we introduce AstroLLaMA, a 7-billion-parameter model fine-tuned from LLaMA-2 using over 300,000 astronomy abstracts from arXiv. Optimized for traditional causal language modeling, AstroLLaMA achieves a 30% lower perplexity than Llama-2, showing marke… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 6 pages, 3 figures, submitted to IJCNLP-AACL 2023. Comments are welcome. The model can be found on Hugging Face - https://huggingface.co/universeTBD/astrollama

  6. arXiv:2212.00744  [pdf, ps, other

    cs.CL astro-ph.IM

    Improving astroBERT using Semantic Textual Similarity

    Authors: Felix Grezes, Thomas Allen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Pavlos Protopapas

    Abstract: The NASA Astrophysics Data System (ADS) is an essential tool for researchers that allows them to explore the astronomy and astrophysics scientific literature, but it has yet to exploit recent advances in natural language processing. At ADASS 2021, we introduced astroBERT, a machine learning language model tailored to the text used in astronomy papers in ADS. In this work we: - announce the first… ▽ More

    Submitted 29 November, 2022; originally announced December 2022.

  7. arXiv:2202.00777  [pdf, ps, other

    cs.HC astro-ph.IM

    Web accessibility trends and implementation in dynamic web applications

    Authors: Timothy W. Hostetler, Shinyi Chen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Carolyn S. Grant, Edwin Henneken, Donna M. Thompson, Roman Chyla, Golnaz Shapurian, Matthew R. Templeton, Kelly E. Lockhart, Nemanja Martinovic, Stephen McDonald, Felix Grezes

    Abstract: The NASA Astrophysics Data System (ADS), a critical research service for the astrophysics community, strives to provide the most accessible and inclusive environment for the discovery and exploration of the astronomical literature. Part of this goal involves creating a digital platform that can accommodate everybody, including those with disabilities that would benefit from alternative ways to pre… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: Submitted to ADASS XXXI (2021)

  8. arXiv:2112.00590  [pdf, ps, other

    cs.CL astro-ph.IM

    Building astroBERT, a language model for Astronomy & Astrophysics

    Authors: Felix Grezes, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Nemanja Martinovic, Shinyi Chen, Chris Tanner, Pavlos Protopapas

    Abstract: The existing search tools for exploring the NASA Astrophysics Data System (ADS) can be quite rich and empowering (e.g., similar and trending operators), but researchers are not yet allowed to fully leverage semantic search. For example, a query for "results from the Planck mission" should be able to distinguish between all the various meanings of Planck (person, mission, constant, institutions and… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  9. arXiv:2009.14323  [pdf

    astro-ph.IM cs.DL

    Enabling Synergy: Improving the Information Infrastructure for Planetary Science

    Authors: Michael J. Kurtz, Alberto Accomazzi, Edwin A. Henneken

    Abstract: In this whitepaper we advocate that the Planetary Science (PS) community build a discipline-specific digital library, in collaboration with the existing astronomy digital library, ADS. We suggest that the PS data archives increase their level of curation to allow for direct linking between the archival data and the derived journal articles. And we suggest that a new component of the PS information… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

    Comments: 8 pages, submitted to the Planetary Science and Astrobiology Decadal Survey 2023-2032

  10. arXiv:2009.05048  [pdf, ps, other

    cs.SE astro-ph.IM

    Agile methodologies in teams with highly creative and autonomous members

    Authors: Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Kris Bukovi

    Abstract: The Agile manifesto encourages us to value individuals and interactions over processes and tools, while Scrum, the most adopted Agile development methodology, is essentially based on roles, events, artifacts, and the rules that bind them together (i.e., processes). Moreover, it is generally proclaimed that whenever a Scrum project does not succeed, the reason is because Scrum was not implemented c… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

    Comments: To appear in the proceedings of the 29th annual international Astronomical Data Analysis Software & Systems (ADASS XXIX)

  11. arXiv:1911.00295  [pdf, other

    cs.DL

    Practice meets Principle: Tracking Software and Data Citations to Zenodo DOIs

    Authors: Stephanie van de Sandt, Lars Holm Nielsen, Alexandros Ioannidis, August Muench, Edwin Henneken, Alberto Accomazzi, Chiara Bigarella, Jose Benito Gonzalez Lopez, Sünje Dallmeier-Tiessen

    Abstract: Data and software citations are crucial for the transparency of research results and for the transmission of credit. But they are hard to track, because of the absence of a common citation standard. As a consequence, the FORCE11 recently proposed data and software citation principles as guidance for authors. Zenodo is recognized for the implementation of DOIs for software on a large scale. The min… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

  12. arXiv:1901.05463  [pdf, ps, other

    astro-ph.IM cs.DL

    Fundamentals of effective cloud management for the new NASA Astrophysics Data System

    Authors: Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Kris Bukovi, Nathan Rapport

    Abstract: The new NASA Astrophysics Data System (ADS) is designed with a serviceoriented architecture (SOA) that consists of multiple customized Apache Solr search engine instances plus a collection of microservices, containerized using Docker, and deployed in Amazon Web Services (AWS). For complex systems, like the ADS, this loosely coupled architecture can lead to a more scalable, reliable and resilient s… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: To appear in the proceedings of the 28th annual international Astronomical Data Analysis Software & Systems (ADASS XXVIII)

  13. arXiv:1803.03598  [pdf

    astro-ph.IM cs.DL physics.soc-ph

    Merging the Astrophysics and Planetary Science Information Systems

    Authors: Michael J. Kurtz, Alberto Accomazzi, Edwin A. Henneken

    Abstract: Conceptually exoplanet research has one foot in the discipline of Astrophysics and the other foot in Planetary Science. Research strategies for exoplanets will require efficient access to data and information from both realms. Astrophysics has a sophisticated, well integrated, distributed information system with archives and data centers which are interlinked with the technical literature via the… ▽ More

    Submitted 9 March, 2018; originally announced March 2018.

    Comments: Whitepaper submitted to the Committee on an Exoplanet Science Strategy

  14. arXiv:1801.01021  [pdf, other

    astro-ph.IM cs.DL

    The Unified Astronomy Thesaurus: Semantic Metadata for Astronomy and Astrophysics

    Authors: Katie Frey, Alberto Accomazzi

    Abstract: Several different controlled vocabularies have been developed and used by the astronomical community, each designed to serve a specific need and a specific group. The Unified Astronomy Thesaurus (UAT) attempts to provide a highly structured controlled vocabulary that will be relevant and useful across the entire discipline, regardless of content or platform. As two major use cases for the UAT incl… ▽ More

    Submitted 3 January, 2018; originally announced January 2018.

    Comments: Submitted to the Astrophysical Journal Supplements, 10 pages, 3 tables

  15. arXiv:1712.06704  [pdf, ps, other

    stat.ML cs.CL cs.IR

    Multilingual Topic Models

    Authors: Kriste Krstovski, Michael J. Kurtz, David A. Smith, Alberto Accomazzi

    Abstract: Scientific publications have evolved several features for mitigating vocabulary mismatch when indexing, retrieving, and computing similarity between articles. These mitigation strategies range from simply focusing on high-value article sections, such as titles and abstracts, to assigning keywords, often from controlled vocabularies, either manually or through automatic annotation. Various document… ▽ More

    Submitted 18 December, 2017; originally announced December 2017.

    Comments: 18 pages, 9 figures

  16. New ADS Functionality for the Curator

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Steven McDonald, Taylor J. Shaulis, Sergi Blanco-Cuaresma, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton

    Abstract: In this paper we provide an update concerning the operations of the NASA Astrophysics Data System (ADS), its services and user interface, and the content currently indexed in its database. As the primary information system used by researchers in Astronomy, the ADS aims to provide a comprehensive index of all scholarly resources appearing in the literature. With the current effort in our community… ▽ More

    Submitted 23 October, 2017; originally announced October 2017.

    Comments: Submitted to the Proceedings of Library and Information Services in Astronomy VIII, Strasbourg, France

  17. arXiv:1601.07858  [pdf, ps, other

    astro-ph.IM cs.DL

    Aggregation and Linking of Observational Metadata in the ADS

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Alexandra Holachek, Jonathan Elliott

    Abstract: We discuss current efforts behind the curation of observing proposals, archive bibliographies, and data links in the NASA Astrophysics Data System (ADS). The primary data in the ADS is the bibliographic content from scholarly articles in Astronomy and Physics, which ADS aggregates from publishers, arXiv and conference proceeding sites. This core bibliographic information is then further enriched b… ▽ More

    Submitted 28 January, 2016; originally announced January 2016.

    Comments: 4 pages, Proceedings of the ADASS XXV conference

  18. arXiv:1503.05881  [pdf, other

    cs.DL

    ADS 2.0: new architecture, API and services

    Authors: Roman Chyla, Alberto Accomazzi, Alexandra Holachek, Carolyn S. Grant, Jonathan Elliott, Edwin A. Henneken, Donna M. Thompson, Michael J. Kurtz, Stephen S. Murray, Vladimir Sudilovsky

    Abstract: The ADS platform is undergoing the biggest rewrite of its 20-year history. While several components have been added to its architecture over the past couple of years, this talk will concentrate on the underpinnings of ADS's search layer and its API. To illustrate the design of the components in the new system, we will show how the new ADS user interface is built exclusively on top of the API using… ▽ More

    Submitted 19 March, 2015; originally announced March 2015.

    Comments: ADASS Conference 2014

  19. arXiv:1503.04194  [pdf, other

    astro-ph.IM cs.DL

    ADS: The Next Generation Search Platform

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Roman Chyla, James Luker, Carolyn S. Grant, Donna M. Thompson, Alexandra Holachek, Rahul Dave, Stephen S. Murray

    Abstract: Four years after the last LISA meeting, the NASA Astrophysics Data System (ADS) finds itself in the middle of major changes to the infrastructure and contents of its database. In this paper we highlight a number of features of great importance to librarians and discuss the additional functionality that we are currently develo**. Starting in 2011, the ADS started to systematically collect, parse… ▽ More

    Submitted 13 March, 2015; originally announced March 2015.

    Comments: Submitted to Library and Information Services in Astronomy VII, Naples, Italy

  20. arXiv:1406.4542  [pdf, ps, other

    cs.DL astro-ph.IM

    Computing and Using Metrics in the ADS

    Authors: Edwin A. Henneken, Alberto Accomazzi, Michael J. Kurtz, Carolyn S. Grant, Donna Thompson, Jay Luker, Roman Chyla, Alexandra Holachek, Stephen S. Murray

    Abstract: Finding measures for research impact, be it for individuals, institutions, instruments or projects, has gained a lot of popularity. More papers than ever are being written on new impact measures, and problems with existing measures are being pointed out on a regular basis. Funding agencies require impact statistics in their reports, job candidates incorporate them in their resumes, and publication… ▽ More

    Submitted 17 June, 2014; originally announced June 2014.

    Comments: to appear in proceedings of LISA VII conference, Naples, Italy

  21. arXiv:1403.6656  [pdf, other

    astro-ph.IM cs.DL

    The Unified Astronomy Thesaurus

    Authors: Alberto Accomazzi, Norman Gray, Chris Erdmann, Chris Biemesderfer, Katie Frey, Justin Soles

    Abstract: The Unified Astronomy Thesaurus (UAT) is an open, interoperable and community-supported thesaurus which unifies the existing divergent and isolated Astronomy & Astrophysics vocabularies into a single high-quality, freely-available open thesaurus formalizing astronomical concepts and their inter-relationships. The UAT builds upon the existing IAU Thesaurus with major contributions from the astronom… ▽ More

    Submitted 26 March, 2014; originally announced March 2014.

    Comments: 4 pages, 1 figure, to appear in Proceedings of Astronomical Data Analysis Software and Systems XXIII, which took place September 29-October 3, 2013

  22. arXiv:1210.8030  [pdf, other

    astro-ph.IM cs.DL

    Astronomy and Computing: a New Journal for the Astronomical Computing Community

    Authors: Alberto Accomazzi, Tamás Budavári, Christopher Fluke, Norman Gray, Robert G Mann, William O'Mullane, Andreas Wicenec, Michael Wise

    Abstract: We introduce \emph{Astronomy and Computing}, a new journal for the growing population of people working in the domain where astronomy overlaps with computer science and information technology. The journal aims to provide a new communication channel within that community, which is not well served by current journals, and to help secure recognition of its true importance within modern astronomy. In… ▽ More

    Submitted 30 October, 2012; originally announced October 2012.

    Comments: 5 pages, no figures; editorial for first edition of journal

  23. arXiv:1206.6352  [pdf, other

    astro-ph.IM cs.DL

    Telescope Bibliographies: an Essential Component of Archival Data Management and Operations

    Authors: Alberto Accomazzi, Edwin Henneken, Christopher Erdmann, Arnold Rots

    Abstract: Assessing the impact of astronomical facilities rests upon an evaluation of the scientific discoveries which their data have enabled. Telescope bibliographies, which link data products with the literature, provide a way to use bibliometrics as an impact measure for the underlying data. In this paper we argue that the creation and maintenance of telescope bibliographies should be considered an inte… ▽ More

    Submitted 30 July, 2012; v1 submitted 27 June, 2012; originally announced June 2012.

    Comments: 10 pages, 3 figures, to appear in SPIE Astronomical Telescopes and Instrumentation, SPIE Conference Series 8448

  24. arXiv:1112.1688  [pdf, ps, other

    astro-ph.IM cs.DL

    Why don't we already have an Integrated Framework for the Publication and Preservation of all Data Products?

    Authors: Alberto Accomazzi, Sebastien Derriere, Chris Biemesderfer, Norman Gray

    Abstract: Astronomy has long had a working network of archives supporting the curation of publications and data. The discipline has already created many of the features which perplex other areas of science: (1) data repositories: (supra)national institutes, dedicated to large projects; a culture of user-contributed data; practical experience of long-term data preservation; (2) dataset identifiers: the commu… ▽ More

    Submitted 7 December, 2011; originally announced December 2011.

    Comments: 4 pages, submitted to the ADASS XXI proceedings

  25. arXiv:1111.3618  [pdf, ps, other

    cs.DL astro-ph.IM

    Linking to Data - Effect on Citation Rates in Astronomy

    Authors: Edwin A. Henneken, Alberto Accomazzi

    Abstract: Is there a difference in citation rates between articles that were published with links to data and articles that were not? Besides being interesting from a purely academic point of view, this question is also highly relevant for the process of furthering science. Data sharing not only helps the process of verification of claims, but also the discovery of new findings in archival data. However, li… ▽ More

    Submitted 15 November, 2011; originally announced November 2011.

    Comments: 4 pages, 3 figures, will appear proceedings of ADASS XXI

  26. arXiv:1106.5644  [pdf, ps, other

    astro-ph.IM cs.DL

    The ADS in the Information Age - Impact on Discovery

    Authors: Edwin A. Henneken, Michael J. Kurtz, Alberto Accomazzi

    Abstract: The SAO/NASA Astrophysics Data System (ADS) grew up with and has been riding the waves of the Information Age, closely monitoring and anticipating the needs of its end-users. By now, all professional astronomers are using the ADS on a daily basis, and a substantial fraction have been using it for their entire professional career. In addition to being an indispensable tool for professional scientis… ▽ More

    Submitted 28 June, 2011; originally announced June 2011.

    Comments: 10 pages, 5 figures, to appear in "Organizations, People and Strategies in Astronomy (OPSA)", volume 8

  27. arXiv:1103.5958  [pdf, other

    astro-ph.IM cs.DL

    Semantic Interlinking of Resources in the Virtual Observatory Era

    Authors: Alberto Accomazzi, Rahul Dave

    Abstract: In the coming era of data-intensive science, it will be increasingly important to be able to seamlessly move between scientific results, the data analyzed in them, and the processes used to produce them. As observations, derived data products, publications, and object metadata are curated by different projects and archived in different locations, establishing the proper linkages between these reso… ▽ More

    Submitted 30 March, 2011; originally announced March 2011.

    Comments: 10 pages, 3 figures, to appear in: ASPC 442 (2011), Proceedings of Astronomical Data Analysis Software and Systems XX

  28. Linking Literature and Data: Status Report and Future Efforts

    Authors: Alberto Accomazzi

    Abstract: In the current era of data-intensive science, it is increasingly important for researchers to be able to have access to published results, the supporting data, and the processes used to produce them. Six years ago, recognizing this need, the American Astronomical Society and the Astrophysics Data Centers Executive Committee (ADEC) sponsored an effort to facilitate the annotation and linking of dat… ▽ More

    Submitted 22 March, 2011; originally announced March 2011.

    Comments: 9 pages, 2 figures, to appear in: Future Professional Communication in Astronomy II (FPCA-II)

  29. arXiv:1006.0670  [pdf, ps, other

    astro-ph.IM cs.DL

    Astronomy 3.0 Style

    Authors: Alberto Accomazzi

    Abstract: Over the next decade we will witness the development of a new infrastructure in support of data-intensive scientific research, which includes Astronomy. This new networked environment will offer both challenges and opportunities to our community and has the potential to transform the way data are described, curated and preserved. Based on the lessons learned during the development and management o… ▽ More

    Submitted 3 June, 2010; originally announced June 2010.

    Comments: 9 pages, 2 figures, to appear in Library and Information Services in Astronomy VI, ASP Conference Proceedings

  30. Finding Your Literature Match -- A Recommender System

    Authors: Edwin A. Henneken, Michael J. Kurtz, Alberto Accomazzi, Carolyn Grant, Donna Thompson, Elizabeth Bohlen, Giovanni Di Milia, Jay Luker, Stephen S. Murray

    Abstract: The universe of potentially interesting, searchable literature is expanding continuously. Besides the normal expansion, there is an additional influx of literature because of interdisciplinary boundaries becoming more and more diffuse. Hence, the need for accurate, efficient and intelligent search tools is bigger than ever. Even with a sophisticated search engine, looking for information can still… ▽ More

    Submitted 13 May, 2010; originally announced May 2010.

    Comments: Contribution to the proceedings of the colloquium Future Professional Communication in Astronomy II, 13-14 April 2010, Cambridge, Massachusetts. 11 pages, 4 figures.

  31. arXiv:0912.5235  [pdf, ps, other

    astro-ph.IM cs.DL cs.IR physics.soc-ph

    Using Multipartite Graphs for Recommendation and Discovery

    Authors: Michael J. Kurtz, Alberto Accomazzi, Edwin Henneken, Giovanni Di Milia, Carolyn S. Grant

    Abstract: The Smithsonian/NASA Astrophysics Data System exists at the nexus of a dense system of interacting and interlinked information networks. The syntactic and the semantic content of this multipartite graph structure can be combined to provide very specific research recommendations to the scientist/user.

    Submitted 30 December, 2009; originally announced December 2009.

    Comments: To appear in ADASS XIX, ASP Conf Proc

  32. arXiv:0909.4789  [pdf

    cs.DL physics.soc-ph

    The Bibliometric Properties of Article Readership Information

    Authors: Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Markus Demleitner, Stephen S. Murray, Nathalie Martimbeau, Barbara Elwell

    Abstract: The NASA Astrophysics Data System (ADS), along with astronomy's journals and data centers (a collaboration dubbed URANIA), has developed a distributed on-line digital library which has become the dominant means by which astronomers search, access and read their technical literature. Digital libraries such as the NASA Astrophysics Data System permit the easy accumulation of a new type of bibliome… ▽ More

    Submitted 25 September, 2009; originally announced September 2009.

    Comments: ADS bibcode: 2005JASIS..56..111K This is the second paper (the first is Worldwide Use and Impact of the NASA Astrophysics Data System Digital Library) from the original article The NASA Astrophysics Data System: Sociology, Bibliometrics, and Impact, which went on-line in the summer of 2003

    Journal ref: The Journal of the American Society for Information Science and Technology, Vol. 56, p. 111 (2005)

  33. arXiv:0909.4786  [pdf

    cs.DL physics.soc-ph

    Worldwide Use and Impact of the NASA Astrophysics Data System Digital Library

    Authors: Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn Grant, Markus Demleitner, Stephen S. Murray

    Abstract: By combining data from the text, citation, and reference databases with data from the ADS readership logs we have been able to create Second Order Bibliometric Operators, a customizable class of collaborative filters which permits substantially improved accuracy in literature queries. Using the ADS usage logs along with membership statistics from the International Astronomical Union and data o… ▽ More

    Submitted 25 September, 2009; originally announced September 2009.

    Comments: ADS bibcode: 2005JASIS..56...36K This is a portion (The bibliometric properties of article readership information is the other part) of the article: The NASA Astrophysics Data System: Sociology, bibliometrics and impact, which went on-line in the summer of 2003

    Journal ref: The Journal of the American Society for Information Science and Technology, Vol. 56, p. 36. (2005)

  34. arXiv:0903.3228  [pdf

    astro-ph.IM cs.DL

    The Smithsonian/NASA Astrophysics Data System (ADS) Decennial Report

    Authors: Michael J. Kurtz, Alberto Accomazzi, Stephen S. Murray

    Abstract: Eight years after the ADS first appeared the last decadal survey wrote: "NASA's initiative for the Astrophysics Data System has vastly increased the accessibility of the scientific literature for astronomers. NASA deserves credit for this valuable initiative and is urged to continue it." Here we summarize some of the changes concerning the ADS which have occurred in the past ten years, and we de… ▽ More

    Submitted 18 March, 2009; originally announced March 2009.

    Comments: 6 pages, whitepaper submitted to the National Research Council Astronomy and Astrophysics Decadal Survey

  35. Use of Astronomical Literature - A Report on Usage Patterns

    Authors: Edwin A. Henneken, Michael J. Kurtz, Alberto Accomazzi, Carolyn S. Grant, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: In this paper we present a number of metrics for usage of the SAO/NASA Astrophysics Data System (ADS). Since the ADS is used by the entire astronomical community, these are indicative of how the astronomical literature is used. We will show how the use of the ADS has changed both quantitatively and qualitatively. We will also show that different types of users access the system in different ways… ▽ More

    Submitted 3 October, 2008; v1 submitted 1 August, 2008; originally announced August 2008.

    Comments: 12 pages, 8 figures, 2 tables. Accepted by Journal of Informetrics

  36. arXiv:cs/0701035  [pdf, ps, other

    cs.DL astro-ph

    Finding Astronomical Communities Through Co-readership Analysis

    Authors: Edwin A. Henneken, Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: Whenever a large group of people are engaged in an activity, communities will form. The nature of these communities depends on the relationship considered. In the group of people who regularly use scholarly literature, a relationship like ``person i and person j have cited the same paper'' might reveal communities of people working in a particular field. On this poster, we will investigate the r… ▽ More

    Submitted 5 January, 2007; originally announced January 2007.

    Comments: poster presented at the 209th AAS Meeting, 7 pages, 4 figures

  37. arXiv:cs/0610030  [pdf, ps, other

    cs.DL cs.HC

    Paper to Screen: Processing Historical Scans in the ADS

    Authors: Donna M. Thompson, Alberto Accomazzi, Guenther Eichhorn, Carolyn Grant, Edwin Henneken, Michael J. Kurtz, Elizabeth Bohlen, Stephen S. Murray

    Abstract: The NASA Astrophysics Data System in conjunction with the Wolbach Library at the Harvard-Smithsonian Center for Astrophysics is working on a project to microfilm historical observatory publications. The microfilm is then scanned for inclusion in the ADS. The ADS currently contains over 700,000 scanned pages of volumes of historical literature. Many of these volumes lack clear pagination or other… ▽ More

    Submitted 5 October, 2006; originally announced October 2006.

    Comments: 4 pages; submitted to the proceedings of Library and Information Services in Astronomy; to be published in the ASP Conference Series

  38. arXiv:cs/0610029  [pdf, ps, other

    cs.DL cs.DB

    Data in the ADS -- Understanding How to Use it Better

    Authors: Carolyn S. Grant, Alberto Accomazzi, Donna Thompson, Edwin Henneken, Guenther Eichhorn, Michael J. Kurtz, Stephen S. Murray

    Abstract: The Smithsonian/NASA ADS Abstract Service contains a wealth of data for astronomers and librarians alike, yet the vast majority of usage consists of rudimentary searches. Hints on how to obtain more focused search results by using more of the various capabilities of the ADS are presented, including searching by affiliation. We also discuss the classification of articles by content and by referee… ▽ More

    Submitted 5 October, 2006; originally announced October 2006.

    Comments: 4 pages; submitted to the proceedings of the Library and Information Services in Astronomy V; to be published by ASP Conference Proceedings

  39. arXiv:cs/0610011  [pdf, ps, other

    cs.DL astro-ph cs.DB cs.IR

    Creation and use of Citations in the ADS

    Authors: Alberto Accomazzi, Gunther Eichhorn, Michael J. Kurtz, Carolyn S. Grant, Edwin Henneken, Markus Demleitner, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: With over 20 million records, the ADS citation database is regularly used by researchers and librarians to measure the scientific impact of individuals, groups, and institutions. In addition to the traditional sources of citations, the ADS has recently added references extracted from the arXiv e-prints on a nightly basis. We review the procedures used to harvest and identify the reference data u… ▽ More

    Submitted 3 October, 2006; originally announced October 2006.

    Comments: 9 pages; to be published in the proceedings of the conference "Library and Information Services V," June 2006, Cambridge, MA, USA

  40. arXiv:cs/0610008  [pdf, ps, other

    cs.DL astro-ph cs.DB

    Connectivity in the Astronomy Digital Library

    Authors: Günther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Edwin A. Henneken, Donna M. Thompson, Michael J. Kurtz, Stephen S. Murray

    Abstract: The Astrophysics Data System (ADS) provides an extensive system of links between the literature and other on-line information. Recently, the journals of the American Astronomical Society (AAS) and a group of NASA data centers have collaborated to provide more links between on-line data obtained by space missions and the on-line journals. Authors can now specify which data sets they have used in… ▽ More

    Submitted 2 October, 2006; originally announced October 2006.

    Comments: To appear in Library and Information Systems in Astronomy V

  41. arXiv:cs/0610007  [pdf, ps, other

    cs.DL astro-ph cs.DB

    Full Text Searching in the Astrophysics Data System

    Authors: Günther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Edwin A. Henneken, Donna M. Thompson, Michael J. Kurtz, Stephen S. Murray

    Abstract: The Smithsonian/NASA Astrophysics Data System (ADS) provides a search system for the astronomy and physics scholarly literature. All major and many smaller astronomy journals that were published on paper have been scanned back to volume 1 and are available through the ADS free of charge. All scanned pages have been converted to text and can be searched through the ADS Full Text Search System. In… ▽ More

    Submitted 5 October, 2006; v1 submitted 2 October, 2006; originally announced October 2006.

    Comments: To appear in Library and Information Systems in Astronomy V

  42. E-prints and Journal Articles in Astronomy: a Productive Co-existence

    Authors: Edwin A. Henneken, Michael J. Kurtz, Simeon Warner, Paul Ginsparg, Guenther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: Are the e-prints (electronic preprints) from the arXiv repository being used instead of the journal articles? In this paper we show that the e-prints have not undermined the usage of journal papers in the astrophysics community. As soon as the journal article is published, the astronomical community prefers to read the journal article and the use of e-prints through the NASA Astrophysics Data Sy… ▽ More

    Submitted 22 September, 2006; originally announced September 2006.

    Comments: 8 pages, 4 figures, submitted to Learned Publishing

    Journal ref: Learn.Publ.20:16-22,2007

  43. arXiv:astro-ph/0609794  [pdf, ps, other

    astro-ph cs.DL

    The Future of Technical Libraries

    Authors: Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn Grant, Edwin Henneken, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: Technical libraries are currently experiencing very rapid change. In the near future their mission will change, their physical nature will change, and the skills of their employees will change. While some will not be able to make these changes, and will fail, others will lead us into a new era.

    Submitted 28 September, 2006; originally announced September 2006.

    Comments: To appear in Library and Information Systems in Astronomy V

  44. arXiv:cs/0608027  [pdf, ps, other

    cs.DL astro-ph

    myADS-arXiv - a Tailor-Made, Open Access, Virtual Journal

    Authors: E. Henneken, M. J. Kurtz, G. Eichhorn, A. Accomazzi, C. S. Grant, D. Thompson, E. Bohlen, S. S. Murray

    Abstract: The myADS-arXiv service provides the scientific community with a one stop shop for staying up-to-date with a researcher's field of interest. The service provides a powerful and unique filter on the enormous amount of bibliographic information added to the ADS on a daily basis. It also provides a complete view with the most relevant papers available in the subscriber's field of interest. With thi… ▽ More

    Submitted 4 August, 2006; originally announced August 2006.

    Comments: 4 pages, 2 figures, poster paper to appear in the proceedings of the LISA V conference

  45. arXiv:cs/0604061  [pdf

    cs.DL astro-ph

    Effect of E-printing on Citation Rates in Astronomy and Physics

    Authors: Edwin A. Henneken, Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn Grant, Donna Thompson, Stephen S. Murray

    Abstract: In this report we examine the change in citation behavior since the introduction of the arXiv e-print repository (Ginsparg, 2001). It has been observed that papers that initially appear as arXiv e-prints get cited more than papers that do not (Lawrence, 2001; Brody et al., 2004; Schwarz & Kennicutt, 2004; Kurtz et al., 2005a, Metcalfe, 2005). Using the citation statistics from the NASA-Smithsoni… ▽ More

    Submitted 5 June, 2006; v1 submitted 13 April, 2006; originally announced April 2006.

    Comments: Submitted to the Journal of Electronic Publishing. 11 pages with 5 figures

  46. arXiv:cs/0511002  [pdf, ps, other

    cs.IR cs.DL

    Bibliographic Classification using the ADS Databases

    Authors: Alberto Accomazzi, Michael J. Kurtz, Guenther Eichhorn, Edwin Henneken, Carolyn S. Grant, Markus Demleitner, Stephen S. Murray

    Abstract: We discuss two techniques used to characterize bibliographic records based on their similarity to and relationship with the contents of the NASA Astrophysics Data System (ADS) databases. The first method has been used to classify input text as being relevant to one or more subject areas based on an analysis of the frequency distribution of its individual words. The second method has been used to… ▽ More

    Submitted 31 October, 2005; originally announced November 2005.

    Comments: Latex, 4 pages, 1 Figure. To be published in the Proceedings of the Conference "Astronomical Data Analysis Software & Systems XV" held October 2-5, 2005, in San Lorenzo de El Escorial, Spain

  47. The Effect of Use and Access on Citations

    Authors: Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn Grant, Markus Demleitner, Edwin Henneken, Stephen S. Murray

    Abstract: It has been shown (S. Lawrence, 2001, Nature, 411, 521) that journal articles which have been posted without charge on the internet are more heavily cited than those which have not been. Using data from the NASA Astrophysics Data System (ads.harvard.edu) and from the ArXiv e-print archive at Cornell University (arXiv.org) we examine the causes of this effect.

    Submitted 14 March, 2005; originally announced March 2005.

    Comments: Accepted for publication in Information Processing & Management, special issue on scientometrics

    ACM Class: H.3.7

    Journal ref: Inform Process Manag 41:1395-1402 (2005)

  48. arXiv:cs/0401028  [pdf, ps, other

    cs.DL

    Automated Resolution of Noisy Bibliographic References

    Authors: Markus Demleitner, Michael Kurtz, Alberto Accomazzi, Günther Eichhorn, Carolyn S. Grant, Steven S. Murray

    Abstract: We describe a system used by the NASA Astrophysics Data System to identify bibliographic references obtained from scanned article pages by OCR methods with records in a bibliographic database. We analyze the process generating the noisy references and conclude that the three-step procedure of correcting the OCR results, parsing the corrected string and matching it against the database provides u… ▽ More

    Submitted 27 January, 2004; originally announced January 2004.

    Comments: 10 pages, 1 figure; accepted for publication in the proceedings of the 2004 Meeting of the International Federation of Classification Societies

    ACM Class: H.3.7; H.3.2