project:cumulizer

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
project:cumulizer [2013/03/22 16:32] lolegproject:cumulizer [2019/02/03 21:16] (current) – old revision restored (2014/08/26 23:55) loleg
Line 1: Line 1:
 ===== Cumulizer ===== ===== Cumulizer =====
  
-This is a Make OpenData.ch Hackday Bern 2013 Finance-themed project with a focus on analyzing personal shopping data. We have started by aggregating data available from the [[http://www.migros.ch/cumulus/de/|Cumulus]] purchase points program run by Migros, but would be interested in expanding the concept to Coop Superpunkt and others.+This is a Make OpenData.ch [[event:2013-03|2013 finance hackdays]] project with a focus on analyzing personal consumer data. We have started by aggregating shopping receipts available from the [[http://www.migros.ch/cumulus/de/|Cumulus]] incentive points program run by Migros, one of the largest supermarkets chains in the country and would be interested in expanding the concept to others.
  
-Fundamentally, we believe that the data we help to collect when we make purchases is relevant and useful to us as individual shoppers, and we want to try provoking popular action to aggregate this (anonymously) with others in our community. We want to learn some of what the shops are already learning about us - what, when and where we make our purchases. Since their data is not open, we need to liberate our personal data by uploading it to a shared repository.+{{:project:cumulizer.png?400|}}
  
-Codehttps://github.com/cstuder/cumulizer+This project is currently no longer live. We are looking for support to bring this to the general public. 
 + 
 +We believe that the data we help to collect while making purchases can be relevant and useful to us individual shoppers, and we want to try starting popular action to aggregate this (anonymously) with others in our community. Although it is unlikely that we will get broad access to this highly valuable data, we want to at least learn some of the same things that shops are already learning about us: what, when and where we make our purchases, what are the patterns, how do we compare. To get around commercial closed data policy, we can liberate, i.e. open our personal data by uploading it to a shared repository. 
 + 
 +{{:project:finance:cumulus-sketch-1.jpg?200|}} {{:project:cumulizer_heatmap.jpg?275|Cumulizer Heatmap}}
  
 ===== Data ===== ===== Data =====
  
-Right now any customer who takes part in the Cumulus program can access detailed data about their purchases in CSV format. This is an excellent initiative from Migros, but the web site is not very user friendly so we would want to automate this process to make it easier for users. +Right now any customer who takes part in the Cumulus program can access detailed data about their purchases in CSV format. This is an excellent initiative from Migros which is somewhat hampered by a not very user friendly web application - we want to automate this process to make it easier for users.
  
-Right now here are the steps to collect your data:+Right now here are the steps to collect your personal data:
  
   - Log into the [[http://www.migros.ch/cumulus/de/secure/punktestand.html|Cumulus program]] using your customer number and a password on a paper mail-out to create an M-Connect account if you have not already   - Log into the [[http://www.migros.ch/cumulus/de/secure/punktestand.html|Cumulus program]] using your customer number and a password on a paper mail-out to create an M-Connect account if you have not already
   - Under **Mein Konto - Kassenbons** you can browse and view details of your [[https://www.migros.ch/cumulus/de/secure/kassenbons.html|shopping receipts]]   - Under **Mein Konto - Kassenbons** you can browse and view details of your [[https://www.migros.ch/cumulus/de/secure/kassenbons.html|shopping receipts]]
-  - Browsing month-by-month, and for every page, you need to click **Alle**, and then **Ausgewählte Kassenbons als Excel-Liste (csv)** to download a file with the details for those shopping trips +  - Browsing month-by-month, and for every page, you need to click **Alle**, and then **Ausgewählte Kassenbons als Excel-Liste (csv)** (not "Übersicht"to download a file with the details for those shopping trips
-  - You are now ready to share your personal data with us.. details on next steps forthcoming+
  
 +{{:project:finance:cumulus-howto.png?100|}}
 +
 +You are now ready to share your personal data with us. At the moment we do not have a live application, but if you are really eager to help, please send us your [[data@cumulizer.eu|CSV files directly]].
 +
 +===== Legal status =====
 +
 +Here is what Migros has to say about data usage in their [[http://www.migros.ch/cumulus/de/ueber-cumulus/datenschutz.html|data protection policy]]:
 +
 +> Das Datenschutzgesetz bestimmt in Artikel 8, dass jede Person von Inhabern einer Datensammlung Auskunft über die zu ihrer Person gespeicherten und bearbeiteten Daten verlangen kann. Diese Auskunftsbegehren müssen schriftlich eingereicht werden. Auf der [[http://www.edoeb.admin.ch/index.html?lang=de|Homepage des eidgenössischen Datenschutz- und Öffentlichkeitsbeauftragten]] finden Sie entsprechende Musterschreiben.
 +
 +(rough translation: they link to the government standard form which you can fill out to request an export of all your personal data)
 +
 +In the [[http://www.migros.ch/cumulus/de/ueber-cumulus/geschaeftsbedingungen.html|Terms and Conditions]] and [[http://www.migros.ch/de/rechtliches-impressum/rechtliches-impressum.html|Impressum]] there is no mention of restrictions on the use of the personal Cumulus data. At the moment we assume, and will contact their management to check, that they will not restrict us from using the receipt data from our personal accounts any way we can, including sharing it publicly.
 +
 +We will obviously not try to collect any other person's data without their full cooperation, attempt to circumvent any of their security, or put any automated scraping/spidering in place. This requires clear statements and terms of use on the site. We are hoping to find a way to cooperate with the Cumulus program, and ensure that we do not abuse the allowances they have made that make this project possible.
 +
 +If you are able to provide additional legal input, [[data@cumulizer.eu|please do]].
 +
 +===== Project status =====
 +
 +We have created an initial working prototype which gives a view into the data of five Cumulus users who have volunteered to submit one year of data anonymously to seed the project.
 +
 +  * To complete Phase I we need to finish developing the dashboard tools that should conveniently and accurately visualize their purchases over time, sorted by category, and so on.
 +  * In Phase II we are aiming to let users upload their own data, so that we get aggregate data from the community, allowing comparison between users who may wish to add a simple profile, and show common trends.
 +  * In Phase III we will be able to link our data with other sources, showing additional information about user purchases, and how buying trends align with, for example [[http://worldbank.270a.info/classification/indicator.html|semantically interlinked economic data]] (world bank via [[data:270a]]) and open [[https://explore.data.gov/Income-Expenditures-Poverty-and-Wealth/Consumer-Expenditure-Survey/3chw-d2fd|consumer surveys]] (data.gov). We will also create APIs and, having a sufficient user base, the project itself will be interesting to social scientists who are reportedly starved for accurate personal economic data.
 +
 +If you'd like to take part in the planning, or otherwise contribute your expertise, please [[data@cumulizer.eu|contact us]].
 +
 +===== Technical information =====
 +
 +Here is a basic guide to our JSON API:
 +
 +^Link^Info^
 +|/dashboard/simpleupload|Simple upload form|
 +|/dashboard/heatmap|Heatmap of the stores|
 +|/maintenance/geocodestores|Geocode store adresses|
 +|/maintenance/autocategorize|Start autocategorization|
 +|/api?action=summary|General statistics|
 +|/api?action=stores|Stores and sales|
 +|/api?action=monthlypurchases|List of purchases by month|
 +|/api?action=spendings|Monthly spending sums by category|
 +|/api?action=categories|List of all categories|
 +
 +For more details, please see the source.
 +
 +<GITHUB cstuder/cumulizer>
 +
 +==== Installation ====
 +
 +  - Download Cumulizer sources from [[https://github.com/cstuder/cumulizer|GitHub]]
 +  - Install [[http://getcomposer.org/|composer]] and [[http://getcomposer.org/doc/00-intro.md#using-composer|run it]] at the root of the project
 +  - Install the project on your Apache/MySQL/PHP server
 +  - Create a database, modify the top of the import script if you don't use 'cumulizer' as the name
 +  - Import the initial schema (_docs/cumulizer.sql)
 +  - Modify username and password (application/config/database.php)
 +  - Upload your CSV receipts (dashboard/simpleupload)
 +  - Run geotagging generator (admin/geocodestores)
 +  - Your dashboard will be ready at /
  
 ===== Team ===== ===== Team =====
Line 27: Line 88:
 ===== Links ===== ===== Links =====
  
 +  * [[http://migros.ch|Migros]] is one of Switzerland's top three supermarkets and has been immensely helpful in creating a legal interface to download our personal data
   * [[http://solikarte.ch/de/|Solikarte]] is a related initiative for anonymously collecting points for charitable causes, and would be a possible data source and partner for this project   * [[http://solikarte.ch/de/|Solikarte]] is a related initiative for anonymously collecting points for charitable causes, and would be a possible data source and partner for this project
-  * [[http://blog.okfn.org/2013/03/08/opening-product-data-for-a-more-responsible-world/|Opening Product Data for a more Responsible World]] (OKFN Blog) talks about the opportunities for open product data +  * [[http://www.onereceipt.com/|One Receipt]] lets customers in the US to see all their purchases in one place 
-  * [[https://www.eff.org/deeplinks/2012/09/deep-dive-facebook-and-datalogix-whats-actually-getting-shared-and-how-you-can-opt|A Deep Dive into Facebook and Datalogix: What's Actually Getting Shared and How You Can Opt Out]] from the Electronic Frontier Foundation explores the marketing bonanza that is consumer data collection, and warns of the privacy risks +  * [[http://blog.okfn.org/2013/03/08/opening-product-data-for-a-more-responsible-world/|Opening Product Data for a more Responsible World]] (OKFN Blog) talks about the opportunities for open product data, and the [[http://product-open-data.com/|Product Open Data]] project 
 +  * [[https://www.eff.org/deeplinks/2012/09/deep-dive-facebook-and-datalogix-whats-actually-getting-shared-and-how-you-can-opt|A Deep Dive into Facebook and Datalogix]] (Electronic Frontier Foundationexplores the marketing bonanza that is consumer data collection, and warns of the privacy risks 
 +  * [[http://www.theatlantic.com/technology/archive/2013/03/what-does-the-consumer-data-industry-know-about-you/273829/|What does the consumer data industry know about you]] (The Atlantic) explores a few more perspectives on the subject
  
-{{tag>status:concept needs:design needs:expert}}+{{tag>status:demo needs:expert finance}}
  • project/cumulizer.1363966355.txt.gz
  • Last modified: 2013/03/22 16:32
  • by loleg