Options
Assessing the FAIRness of Software Repositories Using RDF and SHACL
Hummel, Tobias; Martin, Leon; Henrich, Andreas (2024): Assessing the FAIRness of Software Repositories Using RDF and SHACL, in: Angelo Salatino, Mehwish Alam, Femke Ongenae, u. a. (Hrsg.), Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI, IOS Press, S. 160–175, doi: 10.3233/ssw240014.
Faculty/Chair:
Author:
Title of the compilation:
Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI
Editors:
Salatino, Angelo
Alam, Mehwish
Ongenae, Femke
Vahdati, Sahar
Gentile, Anna-Lisa
Pellegrini, Tassilo
Jiang, Shufan
Conference:
20th International Conference on Semantic Systems, 17–19 September 2024 ; Amsterdam
Publisher Information:
Year of publication:
2024
Pages:
ISBN:
978-1-64368-537-3
Series ; Volume:
Studies on the Semantic Web ; 60
Language:
English
DOI:
Abstract:
Purpose:
A previous paper proposed the usage of SHACL to assess the FAIRness of software repositories. Following this call to action, this paper introduces and discusses the changes made to QUARE, a SHACL-based tool for validating GitHub repositories against sets of quality criteria, to facilitate this task.
Methodology:
An operationalization of the abstract FAIR best practices from previous work is devised to enable a FAIRness assessment based on concrete quality criteria. Afterwards, a SHACL shapes graph implementing these constraints is introduced, followed by a discussion of the efficient generation of suitable RDF representations for GitHub repositories. Improvements regarding the usability of QUARE are examined, as well. An evaluation on the FAIRness of 223 GitHub repositories and on the runtime performance of the assessment is conducted.
Findings:
On average, trending repositories comply with fewer FAIR best practices than repositories expected to be FAIR. However, the latter still exhibit deficiencies, for example, regarding the correct application of semantic versioning. The low average runtime of the FAIRness assessment of respectively 3.50 and 5.73 seconds per repository permits the integration of QUARE in, e.g., CI/CD pipelines.
Value:
The FAIR principles are often mentioned as a measure to tackle the reproducibility crisis, which continues to have a significant impact on science. To implement these principles in practice, it is crucial to provide tools that facilitate the automated assessment of the FAIRness of software repositories. The enhanced version of QUARE introduced in this paper represents our proposal for this demand.
A previous paper proposed the usage of SHACL to assess the FAIRness of software repositories. Following this call to action, this paper introduces and discusses the changes made to QUARE, a SHACL-based tool for validating GitHub repositories against sets of quality criteria, to facilitate this task.
Methodology:
An operationalization of the abstract FAIR best practices from previous work is devised to enable a FAIRness assessment based on concrete quality criteria. Afterwards, a SHACL shapes graph implementing these constraints is introduced, followed by a discussion of the efficient generation of suitable RDF representations for GitHub repositories. Improvements regarding the usability of QUARE are examined, as well. An evaluation on the FAIRness of 223 GitHub repositories and on the runtime performance of the assessment is conducted.
Findings:
On average, trending repositories comply with fewer FAIR best practices than repositories expected to be FAIR. However, the latter still exhibit deficiencies, for example, regarding the correct application of semantic versioning. The low average runtime of the FAIRness assessment of respectively 3.50 and 5.73 seconds per repository permits the integration of QUARE in, e.g., CI/CD pipelines.
Value:
The FAIR principles are often mentioned as a measure to tackle the reproducibility crisis, which continues to have a significant impact on science. To implement these principles in practice, it is crucial to provide tools that facilitate the automated assessment of the FAIRness of software repositories. The enhanced version of QUARE introduced in this paper represents our proposal for this demand.
Keywords: ;  ; 
FAIR software
GitHub repositories
SHACL
Peer Reviewed:
Yes:
International Distribution:
Yes:
Type:
Conferenceobject
Activation date:
October 29, 2024
Versioning
Question on publication
Permalink
https://fis.uni-bamberg.de/handle/uniba/104187