| Both sides previous revision Previous revision Next revision | Previous revision | 
| project:cumulizer [2013/03/23 15:35]  –  birdy1976 | project:cumulizer [2019/02/03 21:16] (current)  – old revision restored (2014/08/26 23:55) loleg | 
|---|
| ===== Cumulizer ===== | ===== Cumulizer ===== | 
|  |  | 
| This is a Make OpenData.ch Hackday Bern 2013 Finance-themed project with a focus on analyzing personal shopping data. We have started by aggregating data available from the [[http://www.migros.ch/cumulus/de/|Cumulus]] purchase points program run by Migros, but would be interested in expanding the concept to Coop Superpunkt and others. | This is a Make OpenData.ch [[event:2013-03|2013 finance hackdays]] project with a focus on analyzing personal consumer data. We have started by aggregating shopping receipts available from the [[http://www.migros.ch/cumulus/de/|Cumulus]] incentive points program run by Migros, one of the largest supermarkets chains in the country and would be interested in expanding the concept to others. | 
|  |  | 
| {{:project:finance:cumulus-sketch-1.jpg?200|}} | {{:project:cumulizer.png?400|}} | 
|  |  | 
| Fundamentally, we believe that the data we help to collect when we make purchases is relevant and useful to us as individual shoppers, and we want to try provoking popular action to aggregate this (anonymously) with others in our community. We want to learn some of what the shops are already learning about us - what, when and where we make our purchases. Since their data is not open, we need to liberate our personal data by uploading it to a shared repository. | This project is currently no longer live. We are looking for support to bring this to the general public. | 
|  |  | 
| <GITHUB cstuder/cumulizer> | We believe that the data we help to collect while making purchases can be relevant and useful to us individual shoppers, and we want to try starting popular action to aggregate this (anonymously) with others in our community. Although it is unlikely that we will get broad access to this highly valuable data, we want to at least learn some of the same things that shops are already learning about us: what, when and where we make our purchases, what are the patterns, how do we compare. To get around commercial closed data policy, we can liberate, i.e. open our personal data by uploading it to a shared repository. | 
|  |  | 
|  | {{:project:finance:cumulus-sketch-1.jpg?200|}} {{:project:cumulizer_heatmap.jpg?275|Cumulizer Heatmap}} | 
|  |  | 
| ===== Data ===== | ===== Data ===== | 
| {{:project:finance:cumulus-howto.png?100|}} | {{:project:finance:cumulus-howto.png?100|}} | 
|  |  | 
| You are now ready to share your personal data with us. At the moment we do not have a live application, but if you are really eager to help, please send us your CSV files directly to //cstuder AT existenz DOT ch// | You are now ready to share your personal data with us. At the moment we do not have a live application, but if you are really eager to help, please send us your [[data@cumulizer.eu|CSV files directly]]. | 
|  |  | 
| ===== Legal status ===== | ===== Legal status ===== | 
| (rough translation: they link to the government standard form which you can fill out to request an export of all your personal data) | (rough translation: they link to the government standard form which you can fill out to request an export of all your personal data) | 
|  |  | 
| In the [[http://www.migros.ch/cumulus/de/ueber-cumulus/geschaeftsbedingungen.html|Terms and Conditions]] and [[http://www.migros.ch/de/rechtliches-impressum/rechtliches-impressum.html|Impressum]] there is no mention of restrictions on the use of the personal Cumulus data. At the moment we assume, and will contact their management to check, that they will not restrict us from using the receipt data from our personal accounts any way we can, including sharing it publicly. We will obviously not try to collect any other person's data without their full cooperation, attempt to circumvent any of their security, or put any automated scraping/spidering in place. We are really hoping to find a way to cooperate with the Cumulus program, not abuse the allowances they have made that make this project possible. | In the [[http://www.migros.ch/cumulus/de/ueber-cumulus/geschaeftsbedingungen.html|Terms and Conditions]] and [[http://www.migros.ch/de/rechtliches-impressum/rechtliches-impressum.html|Impressum]] there is no mention of restrictions on the use of the personal Cumulus data. At the moment we assume, and will contact their management to check, that they will not restrict us from using the receipt data from our personal accounts any way we can, including sharing it publicly. | 
|  |  | 
| ===== Project status ===== | We will obviously not try to collect any other person's data without their full cooperation, attempt to circumvent any of their security, or put any automated scraping/spidering in place. This requires clear statements and terms of use on the site. We are hoping to find a way to cooperate with the Cumulus program, and ensure that we do not abuse the allowances they have made that make this project possible. | 
|  |  | 
| We are working on the initial prototype, aiming to complete a basic Phase I by tomorrow and give a view into the user's own data. | If you are able to provide additional legal input, [[data@cumulizer.eu|please do]]. | 
|  |  | 
| In Phase II we are aiming to aggregate data from the community, allowing multiple users to share their data anonymously, and show common trends. | ===== Project status ===== | 
|  |  | 
| In Phase III we will be able to link our data with other sources, showing additional information about users purchases, and how buying trends align with, for example [[http://worldbank.270a.info/classification/indicator.html|semantically interlinked economic data]] (world bank via [[data:270a]]) and open [[https://explore.data.gov/Income-Expenditures-Poverty-and-Wealth/Consumer-Expenditure-Survey/3chw-d2fd|consumer surveys]] (data.gov). | We have created an initial working prototype which gives a view into the data of five Cumulus users who have volunteered to submit one year of data anonymously to seed the project. | 
|  |  | 
| ==== Example ==== | * To complete Phase I we need to finish developing the dashboard tools that should conveniently and accurately visualize their purchases over time, sorted by category, and so on. | 
| A heatmap of a users purchases sorted geographically | * In Phase II we are aiming to let users upload their own data, so that we get aggregate data from the community, allowing comparison between users who may wish to add a simple profile, and show common trends. | 
|  | * In Phase III we will be able to link our data with other sources, showing additional information about user purchases, and how buying trends align with, for example [[http://worldbank.270a.info/classification/indicator.html|semantically interlinked economic data]] (world bank via [[data:270a]]) and open [[https://explore.data.gov/Income-Expenditures-Poverty-and-Wealth/Consumer-Expenditure-Survey/3chw-d2fd|consumer surveys]] (data.gov). We will also create APIs and, having a sufficient user base, the project itself will be interesting to social scientists who are reportedly starved for accurate personal economic data. | 
|  |  | 
| {{:project:cumulizer_heatmap.jpg?200|Cumulizer Heatmap}} | If you'd like to take part in the planning, or otherwise contribute your expertise, please [[data@cumulizer.eu|contact us]]. | 
|  |  | 
|  | ===== Technical information ===== | 
|  |  | 
|  | Here is a basic guide to our JSON API: | 
|  |  | 
| ===== Development information ===== |  | 
| ^Link^Info^ | ^Link^Info^ | 
| |/dashboard/simpleupload|Simple upload form| | |/dashboard/simpleupload|Simple upload form| | 
| |/api?action=categories|List of all categories| | |/api?action=categories|List of all categories| | 
|  |  | 
| ===== Install information ===== | For more details, please see the source. | 
|  |  | 
| 1) Download Cumulizer from GitHub and upload to your site | <GITHUB cstuder/cumulizer> | 
| 2) Create a database (DB) for Cumulizer |  | 
| 3) Modify "USE cumulizer" with your DB name (_docs/cumulizer.sql) | ==== Installation ==== | 
| 4) Modify username and password (application/config/database.php) |  | 
| 5) Upload receipts (dashboard/simpleupload) | - Download Cumulizer sources from [[https://github.com/cstuder/cumulizer|GitHub]] | 
| 6) Run (admin/geocodestores) | - Install [[http://getcomposer.org/|composer]] and [[http://getcomposer.org/doc/00-intro.md#using-composer|run it]] at the root of the project | 
| 7) View the heatmap (dashboard/heatmap) | - Install the project on your Apache/MySQL/PHP server | 
|  | - Create a database, modify the top of the import script if you don't use 'cumulizer' as the name | 
|  | - Import the initial schema (_docs/cumulizer.sql) | 
|  | - Modify username and password (application/config/database.php) | 
|  | - Upload your CSV receipts (dashboard/simpleupload) | 
|  | - Run geotagging generator (admin/geocodestores) | 
|  | - Your dashboard will be ready at / | 
|  |  | 
| ===== Team ===== | ===== Team ===== | 
| ===== Links ===== | ===== Links ===== | 
|  |  | 
|  | * [[http://migros.ch|Migros]] is one of Switzerland's top three supermarkets and has been immensely helpful in creating a legal interface to download our personal data | 
| * [[http://solikarte.ch/de/|Solikarte]] is a related initiative for anonymously collecting points for charitable causes, and would be a possible data source and partner for this project | * [[http://solikarte.ch/de/|Solikarte]] is a related initiative for anonymously collecting points for charitable causes, and would be a possible data source and partner for this project | 
| * [[http://www.onereceipt.com/|One Receipt]] lets customers in the US see all their purchases in one place | * [[http://www.onereceipt.com/|One Receipt]] lets customers in the US to see all their purchases in one place | 
| * [[http://blog.okfn.org/2013/03/08/opening-product-data-for-a-more-responsible-world/|Opening Product Data for a more Responsible World]] (OKFN Blog) talks about the opportunities for open product data, and the [[http://product-open-data.com/|Product Open Data]] project | * [[http://blog.okfn.org/2013/03/08/opening-product-data-for-a-more-responsible-world/|Opening Product Data for a more Responsible World]] (OKFN Blog) talks about the opportunities for open product data, and the [[http://product-open-data.com/|Product Open Data]] project | 
| * [[https://www.eff.org/deeplinks/2012/09/deep-dive-facebook-and-datalogix-whats-actually-getting-shared-and-how-you-can-opt|A Deep Dive into Facebook and Datalogix]] (Electronic Frontier Foundation) explores the marketing bonanza that is consumer data collection, and warns of the privacy risks | * [[https://www.eff.org/deeplinks/2012/09/deep-dive-facebook-and-datalogix-whats-actually-getting-shared-and-how-you-can-opt|A Deep Dive into Facebook and Datalogix]] (Electronic Frontier Foundation) explores the marketing bonanza that is consumer data collection, and warns of the privacy risks | 
| * [[http://www.theatlantic.com/technology/archive/2013/03/what-does-the-consumer-data-industry-know-about-you/273829/|What does the consumer data industry know about you]] (The Atlantic) explores a few more perspectives on the subject | * [[http://www.theatlantic.com/technology/archive/2013/03/what-does-the-consumer-data-industry-know-about-you/273829/|What does the consumer data industry know about you]] (The Atlantic) explores a few more perspectives on the subject | 
|  |  | 
|  | {{tag>status:demo needs:expert finance}} | 
| {{tag>status:concept needs:design needs:expert consumer products}} |  |