OCLC Linked Data Research
OCLC production units and OCLC Research are involved in Linked Data-related research and standards activities and exploring Linked Data activities and applications. This activity page provides information about a variety of OCLC-related Linked Data-related activities including activities that OCLC Research is closely involved with or leading.
- Publishing Linked Data:
- Schema.org markup in WorldCat.org pages [http://www.oclc.org/news/releases/2012/201238.en.html]
- Downloadable WorldCat data set [http://www.oclc.org/data/data-sets-services.en.html]
- Participation in Linked Data Standards Work
- W3C Library Linked Data Incubator Group (Completed)
- W3C Schema Bib Extend Group [http://www.w3.org/community/schemabibex/]
- Linked Data Webinars
- "Linked Data in VRA Core 4.0: Converting VRA XML Records into RDF/XML" by Jeff Mixter [http://jmixter.s3-website-us-east-1.amazonaws.com/thesis/]
- OCLC Research Linked Data Survey Results (.xlsx: 94kB)
Linked Data Survey Blog Series
- Linked Data Survey results 1 – Who’s doing it
- Linked Data Survey results 2 – Examples in production
- Linked Data Survey results 3 – Why and what institutions are consuming
- Linked Data Survey results 4 – Why and what institutions are publishing
- Linked Data Survey results 5 – Technical details
- Linked Data Survey results 6 – Advice from the implementers
Linked Data is a term which describes an approach to exposing data in a machine-readable form where the data is "de-referenceable" (i.e. URIs are an integral part of the exposed data and external applications can use the URIs to perform various actions such as retrieving data, connecting same/similar/related data from multiple Linked Data stores).
This approach to exposing, sharing, and connecting data has become increasingly popular in recent years, and more and more agencies are publishing data which adheres to Linked Data principles as articulated by Sir Tim Berners-Lee:
Linked Data: Design Issues (Sir Tim Berners-Lee)
- Use URIs to identify things.
- Use HTTP URIs so that these things can be referred to and looked up ("dereferenced") by people and user agents.
- Provide useful information about the thing when its URI is dereferenced, using standard formats such as RDF/XML.
- Include links to other, related URIs in the exposed data to improve discovery of other related information on the Web.
Linked Data is about communities agreeing on the semantics of their common data, adopting the naming patterns of other communities where their semantics agree and mapping/extending those vocabularies when necessary. For example, the library community has a dozen semantic distinctions for the word "title": Uniform Title, Spine Title, Running Title, etc., but they can probably all map to Dublin Core Title. This allows other communities to use a piece of data marked as being a bib:SpineTitle and know that it is strongly equivalent to the dc:Title that they have already been using. The community work that makes this all happen is one form of networking. The semantic mapping and data sharing across communities is another form of networking.
The other form of networking is the web of relations we create in our data when we use URIs to name things and use those URIs where we formerly used strings of content. Instead of using the composer name "Dmitri Shostakovich", which is subject to many misspellings, we can now use a VIAF URI (http://viaf.org/viaf/89612684) to identify him and have a much greater chance of spotting other references to Dmitri when the same URI is used. Even when a different URI is used, there are ways to indicate that they identify the same person. This weaving together of our data through URIs is yet another form of networking.
In our opinion, Schema.org is currently the best/simplest vocabulary to use as a starting point for marking up Linked Data. We are using it in all our new work, including the Virtual International Authority File (VIAF), WorldCat Identities and WorldCat.org.
Linked Data offers the potential for agencies and communities to publish information in a manner that permits far greater utility "in the flow" of the network. In particular, unexpected connections, uses and value may be realized by many parties, including parties with which the hosting/publishing agency might not normally have had contact.
OCLC Research is exploring Linked Data from a variety of angles—as a publisher, consumer, applications-builder, project partner, and through our involvement with Linked Data-related work with standards bodies like the W3C. This work is shaping and informing OCLC Research's and OCLC's thinking and direction with respect to our prototypes, experimental datasets, products and services.
OCLC Research Projects
OCLC Production Projects
- OCLC Developer Network (Developer services)
- Linked Data and VIAF, Thom Hickey, OCLC EMEA Regional Council Meeting, 2 March 2011, Deutsche NationalBibliothek, Frankfurt (Germany)
- OCLC Open Source Linked Data Framework, Ralph LeVan, Access 2010, 15 October 2010, University of Winnipeg (Canada). Speakers notes are available.
- Linked Data 2, Ralph LeVan, OCLC Research TAI CHI Webinar, 1 July 2010
- A Gentle Introduction to Linked Data, Ralph LeVan, OCLC Research TAI CHI Webinar, 27 May 2010
- OCLC Linked Data
- OCLC Developer Network
- linkeddata.org: Linked Data - Connect Distributed Data across the Web
Most recent updates: Page content: 2014-09-07