Evaluation of software impact designed for biomedical research: Are we measuring what's meaningful?
Authors:
Awan Afiaz,
Andrey Ivanov,
John Chamberlin,
David Hanauer,
Candace Savonen,
Mary J Goldman,
Martin Morgan,
Michael Reich,
Alexander Getka,
Aaron Holmes,
Sarthak Pati,
Dan Knight,
Paul C. Boutros,
Spyridon Bakas,
J. Gregory Caporaso,
Guilherme Del Fiol,
Harry Hochheiser,
Brian Haas,
Patrick D. Schloss,
James A. Eddy,
Jake Albrecht,
Andrey Fedorov,
Levi Waldron,
Ava M. Hoffman,
Richard L. Bradshaw
, et al. (2 additional authors not shown)
Abstract:
Software is vital for the advancement of biology and medicine. Analysis of usage and impact metrics can help developers determine user and community engagement, justify additional funding, encourage additional use, identify unanticipated use cases, and help define improvement areas. However, there are challenges associated with these analyses including distorted or misleading metrics, as well as e…
▽ More
Software is vital for the advancement of biology and medicine. Analysis of usage and impact metrics can help developers determine user and community engagement, justify additional funding, encourage additional use, identify unanticipated use cases, and help define improvement areas. However, there are challenges associated with these analyses including distorted or misleading metrics, as well as ethical and security concerns. More attention to the nuances involved in capturing impact across the spectrum of biological software is needed. Furthermore, some tools may be especially beneficial to a small audience, yet may not have compelling typical usage metrics. We propose more general guidelines, as well as strategies for more specific types of software. We highlight outstanding issues regarding how communities measure or evaluate software impact. To get a deeper understanding of current practices for software evaluations, we performed a survey of participants in the Informatics Technology for Cancer Research (ITCR) program funded by the National Cancer Institute (NCI). We also investigated software among this community and others to assess how often infrastructure that supports such evaluations is implemented and how this impacts rates of papers describing usage of the software. We find that developers recognize the utility of analyzing software usage, but struggle to find the time or funding for such analyses. We also find that infrastructure such as social media presence, more in-depth documentation, the presence of software health metrics, and clear information on how to contact developers seem to be associated with increased usage rates. Our findings can help scientific software developers make the most out of evaluations of their software.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
Motivation, inclusivity, and realism should drive data science education
Authors:
Candace Savonen,
Carrie Wright,
Ava M. Hoffman,
Elizabeth M. Humphries,
Katherine E. L. Cox,
Frederick J. Tan,
Jeffrey T. Leek
Abstract:
Data science education provides tremendous opportunities but remains inaccessible to many communities. Increasing the accessibility of data science to these communities not only benefits the individuals entering data science, but also increases the field's innovation and potential impact as a whole. Education is the most scalable solution to meet these needs, but many data science educators lack f…
▽ More
Data science education provides tremendous opportunities but remains inaccessible to many communities. Increasing the accessibility of data science to these communities not only benefits the individuals entering data science, but also increases the field's innovation and potential impact as a whole. Education is the most scalable solution to meet these needs, but many data science educators lack formal training in education. Our group has led education efforts for a variety of audiences: from professional scientists to high school students to lay audiences. These experiences have helped form our teaching philosophy which we have summarized into three main ideals: 1) motivation, 2) inclusivity, and 3) realism. To put these ideals better into practice, we also aim to iteratively update our teaching approaches and curriculum as we find ways to better reach these ideals. In this manuscript we discuss these ideals as well practical ideas for how to implement these philosophies in the classroom.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
Open-source Tools for Training Resources -- OTTR
Authors:
Candace Savonen,
Carrie Wright,
Ava M. Hoffman,
John Muschelli,
Katherine Cox,
Frederick J. Tan,
Jeffrey T. Leek
Abstract:
Data science and informatics tools are develo** at a blistering rate, but their users often lack the educational background or resources to efficiently apply the methods to their research. Training resources often deprecate because their maintenance is not prioritized by funding, giving teams little time to devote to such endeavors. Our group has developed Open-source Tools for Training Resource…
▽ More
Data science and informatics tools are develo** at a blistering rate, but their users often lack the educational background or resources to efficiently apply the methods to their research. Training resources often deprecate because their maintenance is not prioritized by funding, giving teams little time to devote to such endeavors. Our group has developed Open-source Tools for Training Resources (OTTR) to offer greater efficiency and flexibility for creating and maintaining online course content. OTTR empowers creators to customize their work and allows for a simple workflow to publish using multiple platforms. OTTR allows content creators to publish material to multiple massive online learner communities using familiar rendering mechanics. OTTR allows the incorporation of pedagogical practices like formative and summative assessments in the form of multiple choice questions and fill in the blank problems that are automatically graded. No local installation of any software is required to begin creating content with OTTR. Thus far, 15 courses have been created with OTTR repository template. By using the OTTR system, the maintenance workload for updating these courses across platforms has been drastically reduced.
△ Less
Submitted 10 March, 2022;
originally announced March 2022.