project:schauspielhauswikidata

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
project:schauspielhauswikidata [2017/09/16 16:20] – [Performing arts and Wikidata] liowalterproject:schauspielhauswikidata [2020/03/17 11:55] (current) – [Data] birk
Line 5: Line 5:
 ===== Data ===== ===== Data =====
  
-  * [[https://opendata.swiss/de/dataset/repertoire-des-schauspielhaus-zurich-1938-1968|Repertoire des Schauspielhaus Zürich, 1938-1968]]+  * [[https://opendata.swiss/de/dataset/repertoire-des-schauspielhaus-zurich-1938-19682|Repertoire des Schauspielhaus Zürich, 1938-1968]]
  
  
Line 12: Line 12:
   * OpenRefine to clean and reconcile data with wikidata   * OpenRefine to clean and reconcile data with wikidata
   * [[https://tools.wmflabs.org/wikidata-todo/quick_statements.php|Wikidata Quick Statements]] to add new items to wikidata   * [[https://tools.wmflabs.org/wikidata-todo/quick_statements.php|Wikidata Quick Statements]] to add new items to wikidata
-  * [[https://tools.wmflabs.org/quickstatements/|Wikidata Quick Statements 2 Beat]] to add new items to wikidata in batch mode+  * [[https://tools.wmflabs.org/quickstatements/|Wikidata Quick Statements 2 Beta]] to add new items to wikidata in batch mode
  
  
Line 28: Line 28:
  
      
 +===== Methodology =====
 +
 +  - load data in OpenRefine
 +  - Column after column (starting with the easier ones) :
 +    - reconcile against wikidata
 +    - manually match entries that matched multiple entries in wikidata
 +    - find out what items are missing in wikidata
 +    - load them in wikidata using quick statements (quick statements 2 allow you to retrieve the Q numbers of the newly created items)
 +    - reconcile again in OpenRefine
 +
 +===== Results =====
 +
 +  * https://www.wikidata.org/wiki/Q39907209
 +  * Code to transform csv into quick statements : https://github.com/j4lib/performing-arts/blob/master/create_quick_statements.php
 +===== Screenshots =====
 +
 +
 +====== Raw Data ======
 +{{:project:screenshot_132.png?direct&1000|}}
 +
 +
 +
 +====== Reconcile in OpenRefine ======
 +{{:project:screenshot_133.png?direct&1000|}}
 +
 +
 +====== Choose corresponding type ======
 +For Work, you can use the author as an additional property
 +
 +{{:project:screenshot_134.png?direct&1000|}}
 +
 +
 +====== Manually match multiple matches ======
 +{{:project:screenshot_136.png?direct&1000|}}
 +
 +
 +
 +====== Import in Wikidata with quick statements ======
 +
 +Step 1
 +{{:project:screenshot_138.png?direct&1000|}}
 +
 +
 +
 +  * Len : english label
 +  * P31 : instance of
 +  * Q5 : human
 +  * P106 : occupation
 +  * Q1323191 : costume designer
 +
 +Step 2
 +{{:project:screenshot_139.png?direct&1000|}}
 +
 +Step 3 (you can get the Q number from there)
 +
 +{{:project:screenshot_140.png?direct&1000|}}
 +
 +
 +
 +
 ===== Team ===== ===== Team =====
  
  • project/schauspielhauswikidata.1505571652.txt.gz
  • Last modified: 2017/09/16 16:20
  • by liowalter