Lucia Sohmen

TIB – Leibniz-Informationszentrum Technik und Naturwissenschaften

Lucia Sohmen studierte Bibliotheks- und Informationswissenschaft an der Humboldt-Universität zu Berlin. Seit 2017 ist sie an der TIB im Team Open Science Lab. Dort war sie zunächst für das DFG-Projekt „Nachnutzung von Open-Access-Abbildungen“ zuständig. Ende 2020 begann sie ihre Arbeit im Projekt NFDI4Culture, wo sie im Culture Coordination Office ist und den Knowledge Graph bearbeitet.

 

Developing an open source data pipeline for participation in community knowledge bases
December 2, 2021 12:20 - 12:40
Veranstaltungsraum: Tech Corner

 

How can users participate in Wikidata and other community knowledge bases at a large scale? We will showcase OpenRefine, an open source software for manipulating, enriching and uploading a big amount of data at once.

Many communities maintain digital knowledge bases to store and curate their communal knowledge in one place. In the biggest online community knowledge base, Wikidata and the software behind it - Wikibase - that knowledge is structured in semantic, machine-readable way by using linked open data. This presentation will showcase OpenRefine, an open source software for participation in Wikidata and Wikibase with a user-friendly interface that is still able to manipulate, enrich and upload a big amount of data at once. We will discuss a variety of use cases that are based on a data upload pipeline for 3D data enrichment in NFDI4Culture as well as a bring-your-own-data workshop from November 2021. In addition, we will demo the use of OpenRefine to transform data, match it with existing items in public knowledge bases, use the information from those items to enrich our own data, and finally contribute new information back to the community knowledge base. The core principles behind our data.

zurück zur Liste