project:mutilingual_data_search

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
project:mutilingual_data_search [2018/10/27 11:52] – [Multilingual Data Search based on Wikidata] elenaproject:mutilingual_data_search [2018/11/20 08:25] (current) liowalter
Line 1: Line 1:
-[[http://example.com|External Link]] +===== Wikidata-based multilingual library search ===== 
-===== Multilingual Data Search based on Wikidata =====+ 
 + 
 +{{ :project:poster.jpg?600 |}}
  
 In Switzerland each linguistic region is working with different authority files for authors and organizations, situation which brings difficulties for the end user when he is doing a search. In Switzerland each linguistic region is working with different authority files for authors and organizations, situation which brings difficulties for the end user when he is doing a search.
Line 19: Line 21:
   - create a multilingual search prototype (based on Swissbib model)   - create a multilingual search prototype (based on Swissbib model)
  
- 
-Add sections below if you need more room. Include links to your demo and/or source code, relevant documentation, tools, etc. 
  
 ===== Data ===== ===== Data =====
Line 26: Line 26:
 {{:project:stats.jpg?nolink&300|}} {{:project:stats.jpg?nolink&300|}}
  
 +Number of ID in VIAF
 +  * BNF: 4847978
 +  * GND: 8922043
 +  * RERO: 255779
  
-Number of RERO id in VIAF 255779 +Number of ID in Wikidata from 
- +  * BNF432273 
-Number of GND id in VIAF 8922043 +  GND: 693381 
- +  * RERO: 2145 
-Number of BNF id in VIAF : 4847978+  * ICCU: 30047 
 +  * VIAF: 1319031 (many duplicates: a WD-entity can have more than one VIAF ID)
  
  
Line 59: Line 64:
 </code> </code>
  
 +Email from the GND :
  
  
 +<code>
 +There is currently no process that guarantees 100% coverage of GND entities in wikibase. The existing links between wikibase and GND entries come mostly from manually edited Wikipedia entries.
 +</code>
  
  
-Number of GND id in Wikidata: 693381 
- 
-Number of VIAF id in Wikidata: 1319031  
  
-Number of BNF id in Wikidata: 432273 
  
-Number of ICCU id (SBN ID) in Wikidata: 30047 
  
  
Line 75: Line 79:
  
 ===== User Interface ===== ===== User Interface =====
-There are several different target users: the librarians who currently use all kinds of different systems and the end user, who wants to search for information or to locate a book in a nearby library.+ 
 +**There are several different target users:** the librarians who currently use all kinds of different systems and the end user, who wants to search for information or to locate a book in a nearby library.
  
 **Librarian**: The question of process is the key challenge concerning the librarian user. At present some Swiss librarians create authority records and some don't.  New rules and processes for creating authority files in GND , BNF, etc will change their work methods.  The process of creating local Swiss authority files will be entirely revamped. Fragmented Swiss regional authority files will disappear, and be replaced by either the German, French, Italian, American etc national authority files or by direct creation in Wikidata by the local librarian. (Wikidata will serve as central repository for all autority IDs). **Librarian**: The question of process is the key challenge concerning the librarian user. At present some Swiss librarians create authority records and some don't.  New rules and processes for creating authority files in GND , BNF, etc will change their work methods.  The process of creating local Swiss authority files will be entirely revamped. Fragmented Swiss regional authority files will disappear, and be replaced by either the German, French, Italian, American etc national authority files or by direct creation in Wikidata by the local librarian. (Wikidata will serve as central repository for all autority IDs).
Line 84: Line 89:
 The model for the multilingual user interface is **SwissBib**, the "catalog of Swiss univerity libraries, the Swiss national library, several cantonal libraries and other institutions". **The objective is to keep the look and functionalities of the existing website**, which includes multilingual display of labels in English, French, German and Italian.  The model for the multilingual user interface is **SwissBib**, the "catalog of Swiss univerity libraries, the Swiss national library, several cantonal libraries and other institutions". **The objective is to keep the look and functionalities of the existing website**, which includes multilingual display of labels in English, French, German and Italian. 
  
-**What changes is the source of information about the author** which will in the future be taken from the BNF for French, the GNB for German, and the LCCN for English. (In the proof of concept pilot, only the author name will be concerned.) +**What changes is the source of information about the author** which will in the future be taken from the BNF for French, the GNB for German, and LCCN for English. (In the proof of concept pilot, only the author name will be concerned.) 
  
 The list of books and libraries will continue to function as before, with no changes. The list of books and libraries will continue to function as before, with no changes.
Line 107: Line 112:
 Other design question: Should the selection of the language of the book be moved up on the page? Other design question: Should the selection of the language of the book be moved up on the page?
  
-Screenshots of Swissbib at present+=====Prototype===== 
 + 
 +(Translations from Wikidata into French) 
 + 
 +1. **Schweizerisches Landesmuseum** 
 + 
 +http://feature.swissbib.ch/Record/110393589 
 + 
 +{{ :project:fr_landesmuseum.jpg?nolink |}} 
 + 
 + 
 +2. **Wikimedia Foundation** 
 + 
 +http://feature.swissbib.ch/Record/070092974 
 + 
 +{{ :project:fr_wikimedia_foundation.jpg?nolink |}} 
 + 
 + 
 +3. **Chocoladefabriken Lindt & Sprüngli AG** 
 + 
 +http://feature.swissbib.ch/Record/279360789 
 + 
 +ATTENTION: **Multiple authors** 
 + 
 +{{ :project:fr_chocolate_multiple_authors.jpg?nolink&300 |}} 
 + 
 + 
 +4. **Verband schweizerischer Antiquare und Kunsthändler** 
 + 
 +ATTENTION: **NO French label in Wikidata** 
 + 
 +http://feature.swissbib.ch/Record/107734591 
 + 
 +{{ :project:fr_verband.jpg?nolink |}} 
 + 
 + 
 + 
 + 
  
  
Line 120: Line 163:
  
  
 +Lookup Qcode from GND ID
 +<code>
 +SELECT DISTINCT ?item ?itemLabel  WHERE {
 +  ?item wdt:P227 "1027690041".
 +  SERVICE wikibase:label { bd:serviceParam wikibase:language "en". }
 +}
 +</code>
 +===== Integration of RERO person/organisation data into Wikidata =====
 +
 +==== Methodology ====
 +
 +4 cases
 +
 +1. **RERO authorities are in Wikidata** with RERO ID
 +
 +  * 2145 items
 +
 +2. **RERO authorities are in Wikidata** without RERO ID but with VIAF ID
 +
 +  * 1316347 items (without deduplication)
 +  * add only ID possibles (PetScan)
 +
 +3. **RERO authorities are in Wikidata** without RERO or VIAF ID
 +
 +  * reconcialiation with OpenRefine
 +
 +4. **RERO authorities are not in Wikidata**
 +
 +  * Quickstatements or mass import
 +
 +
 +===== Demo / Code / Final presentation =====
  
 +  * Demo http://feature.swissbib.ch/Record/317008587 and you can search other records
 +  * Code https://github.com/swissbib/vufind/tree/feature/wikidata-based-multilingual
 +  * Final presentation : {{ :project:presentation.pdf |}}
  
 ===== Team ===== ===== Team =====
Line 128: Line 206:
   * Lionel Walter   * Lionel Walter
   * Laurel Zuckerman   * Laurel Zuckerman
 +  * Jacqueline Martinelli
      
 {{tag>status:concept needs:dev needs:design needs:data needs:expert glam}} {{tag>status:concept needs:dev needs:design needs:data needs:expert glam}}
  
  
-==== Tools to try ==== 
  
-https://www.npmjs.com/package/wikidata-entity-lookup 
  
  
  • project/mutilingual_data_search.1540633958.txt.gz
  • Last modified: 2018/10/27 11:52
  • by elena