Data Science & Metadata Research

To be discoverable by today’s online users, traditional library data must be transformed. OCLC Research analyzes bibliographic data to derive new meaning, insights, and services for use by library and information seekers. This work includes special projects, data science research, engagement with metadata communities, publications and presentations, and the creation of illustrative experimental applications.

Presentations

Case Studies of US Research Information Management

Case Studies of US Research Information Management

By Rebecca Bryant

VIVO 2021 International Conference
virtual

This session shares findings from a forthcoming OCLC Research report on Research Information Management Practices in the United States (http://oc.lc/us-rim-project), scheduled for early fall 2021. The report collects evidence from in-depth case studies of RIM practices at five US research universities: Penn State University, Texas A&M University, Virginia Tech, UCLA, and University of Miami. The case studies represent open source, proprietary, and home grown RIM solutions at the five institutions and highlight the proliferation of use cases such as public portals, faculty activity reporting, and strategic reporting.

By synthesizing information from the five case studies, we offer a comprehensive definition of Research Information Management and also document the multiple use cases that proliferate in decentralized US research universities. We will also offer a new RIM System Framework, which describes the required and optional functional and technical elements that comprise the architecture of US RIM systems, regardless of use case. We believe that this framework will help demystify RIM infrastructure and also help practitioners better understand the array of campus stakeholders required for successful RIM implementation.

This research is based upon interviews with 39 participants engaged in RIM activities at the five case study institutions and builds upon the significant body of work on RIM practices already produced by OCLC Research (oc.lc/rim). We believe this research is of considerable utility to the university community, offering a more comprehensive and strategic view of RIM practices, along with recommendations for institutions. We will conclude the presentation by demonstrating the value of the case studies and framework through examples pulled from the report’s case studies.

Topics: Research Information Management

Bringing IIIF Manifests to Life in Wikidata with Mirador 3 - 2021 IIIF Annual Conference

Bringing IIIF Manifests to Life in Wikidata with Mirador 3 - 2021 IIIF Annual Conference

By Jeff Mixter, Gina Solares

IIIF Annual Conference
virtual

Wikidata is an open knowledge base of structured data that describes any type of entity, including people, organizations, concepts, events, places, and works. Some works described in Wikidata now include a IIIF Presentation Manifest URL. In Wikidata’s default user interface, that URL appears as a link to the Manifest JSON. But Wikidata can be customized to alter the user interface and add new features.

In this presentation we will discuss and demonstrate a Wikidata user script that, for items that include a IIIF Presentation Manifest URL, will embed the ProjectMirador viewer and load the Manifest JSON so that the images referenced in the Manifest can be viewed in the context of other Wikidata statements about the work. 

The discussion will cover how the user script embeds the Mirador3 viewer within a Wikidata item page and how it detects that the viewer should be added. We will also illustrate how one library is including IIIF manifests in Wikidata, with a conversation about learnings from that work, and about how the user script has contributed to the library's understanding of IIIF metadata and Wikidata. The demonstration will show how Wikidata user scripts are created and shared and look at ways in which Wikidata queries can uncover IIIF manifests.

Topics: Wikimedia, IIIF

Open for All, Reusable for Whom?: A Review of What Data Reusers Want and How Data Repositories Can Deliver.

Open for All, Reusable for Whom?: A Review of What Data Reusers Want and How Data Repositories Can Deliver

By Ixchel M. Faniel, Lisa Johnston, Katie Wissel

Open Repositories 2021
virtual

Understanding how data reusers seek and evaluate potential data for reuse will aid data curators, data managers, and developers in the open repository field. We will review past studies of data reusers, specifically a qualitative study of 105 researchers from three disciplinary communities: quantitative social science, archaeology, and zoology. The study identified 12 types of context information that data reusers mention needing when deciding whether to reuse data. Next, we will use the context types to create a feature set and assess how data repositories provide the needed context information to users. Finally, using findings from our assessment, we will showcase desirable features in use to prototype the design of a reuser-oriented data repository that developers can use to improve their data repository interface.

Topics: Open Access, Research Data Management