project:chparlscraping

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
project:chparlscraping [2015/09/06 18:42] – [Team] typos jkproject:chparlscraping [2015/09/07 16:23] (current) yrochat
Line 20: Line 20:
 === Data === === Data ===
 Raw data are available as one single JSON file, and its .csv counterpart. We had size problems, thus exploring ways to produce several .csv. Raw data are available as one single JSON file, and its .csv counterpart. We had size problems, thus exploring ways to produce several .csv.
-**The [[https://github.com/douglas-watson/parl-scraping/tree/master/data|final folder for our data is on github]], the [[https://github.com/douglas-watson/parl-scraping/blob/master/data/merged-csv.zip|.csv files are here]].**+**The [[https://github.com/douglas-watson/parl-scraping/tree/master/data|final folder for our data is on github]], the [[https://github.com/douglas-watson/parl-scraping/blob/master/data/with-bio-split-csv.zip|.csv files split by session are here]].**
  
-== Structure of the main JSON (on Giovanni's side, complement with bio data from Jeremie) ==+== Structure of the main JSON (on Giovanni's side, complement with the bio data from the Parliament API) ==
  
 list of interventions, with the following fields: list of interventions, with the following fields:
Line 37: Line 37:
   * Data: transcript of the intervention   * Data: transcript of the intervention
   * Name: of the person speaking   * Name: of the person speaking
- 
-== Graph data for Yannick == 
- 
-graph.csv: edgeless with Source (bio url as id of person replying to) - Destination (bio url as id of person talking before) - Subject (id of subject under discussion) - Date (of intervention, YY.MM.DD) 
-nodes.csv: nodelist with bio id - name - surname - canton - political group 
  
 == Structure of the Parliament API data via Yannick == == Structure of the Parliament API data via Yannick ==
Line 55: Line 50:
 == Structure of the final files from Jérémie == == Structure of the final files from Jérémie ==
  
-1 JSON file + its .csv counterpart for each Parliament session from 1995.+1 JSON file + its .csv counterpart for each Parliament session of the National Council from 1995. 
 +The same datasets are also available split as one JSON/CSV file per legislative session. 
 + 
 +== Graph data for Yannick == 
 + 
 +graph.csv: edgeless with Source (bio url as id of person replying to) - Destination (bio url as id of person talking before) - Subject (id of subject under discussion) - Date (of intervention, YY.MM.DD) 
 +nodes.csv: nodelist with bio id - name - surname - canton - political group
  
 === Results visualization === === Results visualization ===
Line 71: Line 72:
  
   * A simple gender gap visualization for the current Parliament that kind of summarizes it all: <[[https://docs.google.com/spreadsheets/d/1MiO6w331UMGX4vYTyhsMs5uUAgAUCJfyzPkRqAUSjww/edit?usp=sharing|gsheet shalf]]>   * A simple gender gap visualization for the current Parliament that kind of summarizes it all: <[[https://docs.google.com/spreadsheets/d/1MiO6w331UMGX4vYTyhsMs5uUAgAUCJfyzPkRqAUSjww/edit?usp=sharing|gsheet shalf]]>
- 
 ===== Team ===== ===== Team =====
   * Giovanni Colavizza, [[https://github.com/Giovanni1085|github: Giovanni1085]]   * Giovanni Colavizza, [[https://github.com/Giovanni1085|github: Giovanni1085]]
Line 77: Line 77:
   * [[http://shalf.me|Yann Heurtaux]] [[https://twitter.com/shalf|@shalf]], [[https://github.com/shalf|github: shalf]]   * [[http://shalf.me|Yann Heurtaux]] [[https://twitter.com/shalf|@shalf]], [[https://github.com/shalf|github: shalf]]
   * Fabrice Hong, [[https://github.com/fabricehong|github: fabricehong]]   * Fabrice Hong, [[https://github.com/fabricehong|github: fabricehong]]
 +  * Jan Iwaszkiewicz, [[https://github.com/jan44|github: jan44]]
   * Jérémie Knüsel [[https://twitter.com/ambystome|@ambystome]], [[https://github.com/knuesel|github: knuesel]]   * Jérémie Knüsel [[https://twitter.com/ambystome|@ambystome]], [[https://github.com/knuesel|github: knuesel]]
   * Sylvain Moesching   * Sylvain Moesching
Line 82: Line 83:
   * [[http://yro.ch|Yannick Rochat]] [[https://twitter.com/yrochat|@yrochat]], [[https://github.com/yrochat|github: yrochat]]   * [[http://yro.ch|Yannick Rochat]] [[https://twitter.com/yrochat|@yrochat]], [[https://github.com/yrochat|github: yrochat]]
   * Douglas Watson, [[https://github.com/douglas-watson|github: douglas-watson]]   * Douglas Watson, [[https://github.com/douglas-watson|github: douglas-watson]]
-  * Jan Iwaszkiewicz, [[https://github.com/jan44|github: jan44]]+
  
 ===== Links ===== ===== Links =====
  • project/chparlscraping.1441557741.txt.gz
  • Last modified: 2015/09/06 18:42
  • by jk