Data Science & Metadata Research
To be discoverable by today’s online users, traditional library data must be transformed. OCLC Research analyzes bibliographic data to derive new meaning, insights, and services for use by library and information seekers. This work includes special projects in metadata enrichment, authorities & identities, linked data, subjects & classification, and data analysis.
Presentations

Lessons from Representing Library Metadata in OCLC Research’s Linked Data Wikibase Prototype (video)
Hamburg, Germany
This presentation highlights key lessons from OCLC Research’s Linked Data Wikibase Prototype (“Project Passage”), a 10-month pilot done in 2018 in collaboration with metadata specialists in 16 US libraries.
Additional Materials:
PowerPoint Slides (11MB)
Topics: Linked Data

What are the entities that matter, and how much should we say about them?
Virtual
This presentation discusses the work of catalogers who participated in OCLC's Project Passage in 2018. It develops the theme of identification of "the entities that matter" and concludes with a brief update on OCLC's post-Passage activities involving resource description in Wikibase.
Topics: Linked Data, Wikimedia

How IIIF standards improve search and discovery for Cultural Heritage collections
Tampa, Florida, USA
IIIF is an emerging standard for sharing digital structural metadata. OCLC is an active member of the IIIF community and has been working to integrate the standard in is services/products. This talk discusses the experimental IIIF work being done by OCLC Research to help test evolving IIIF standards and help integrate them into production services.
Topics: IIIF, Linked Data

Introducing the CONTENTdm Linked Data Pilot Project
Indianapolis, IN, USA
The CONTENTdm Linked Data pilot explores how to convert CONTENTdm data into linked data, how to curate the data in the Wikibase infrastructure, and how to use the data to improve end-user experiences in CONTENTdm. This presentation covers the background research that led to the development of the pilot, the plans for the 3 phases of the pilot, and some early feedback from one of the pilot participants.
Topics: Linked Data, IIIF

IIIF Change Discovery in Action: Findings from an OCLC Research Experiment
Göttingen, Germany
OCLC Research is participating in the IIIF Discovery Working Group's on-going effort to develop a "Change Discovery API". The Change Discovery API will provide the information needed to discover and subsequently make use of IIIF resources.
Topics: IIIF, Linked Data

Fast and Discriminative Semantic Embedding
Gothenburg, Sweden
We present a novel, effective and efficient method for term and document embedding method. Our experiments show it outperforms state-of-the-art methods in terms of the STS benchmark and subject prediction when trained on the same datasets, while at the same time being computationally cheaper by orders of magnitude.
Topics: Semantic Embedding

An Innovative Approach to Scalable Semantic Embedding
Pittsburgh, Pennsylvania, USA
Semantic search, in addition to keyword based search, is a desirable feature for many digital library systems. Even in the largely structured library data world, there is still a lot of tacit information locked in the free-text fields. Embedding words and texts in compact, semantically meaningful vector spaces allows for computable semantic similarity/relatedness which would make search more intelligent.
Topics: Semantic Embedding

Ideation to Prototype: Turning new ideas into useful services
Boston, Massachusetts, USA
Using the Wikibase Linked Data Prototype as an example, Pace will outline 5 simple steps for managing a complex project that will improve your chances for getting from an experiment to a production service.
Topics: Linked Data

Taking Advantage of Multilingualism Support in Wikidata
Boston, MA (USA)
View highlights of some key lessons from the OCLC Research Linked Data Wikibase Prototype (“Project Passage”) regarding Wikidata’s multilingualism support.
Topics: Wikimedia, Linked Data

What are the "entities that matter" to this object? Reflections on the OCLC Linked Data Wikibase Pilot)
Boston, Massachusetts, USA
This presentation shares highlights of the soon-to-be-published OCLC research report on Project Passage, OCLC’s pilot study of the metadata creation workflow using the Wikibase platform, which was completed in September 2018.
Topics: Linked Data