Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
project:openfooddna [2015/06/06 12:18] – heluc | project:openfooddna [2015/06/06 16:30] (current) – heluc | ||
---|---|---|---|
Line 1: | Line 1: | ||
===== Open Food DNA ===== | ===== Open Food DNA ===== | ||
- | (screenshots or sketches up here) | + | The basic idea is to carry out a citizen science project to compile an open dataset of DNA information about food and beverages. |
- | The basic idea is to create an open dataset of DNA information | + | Many more details |
Take beer. The assumption is that the DNA content (genetic or genomic information about every single living organism that was present during the brewing) may correlate with the type and taste of the brew. | Take beer. The assumption is that the DNA content (genetic or genomic information about every single living organism that was present during the brewing) may correlate with the type and taste of the brew. | ||
- | We want to build a data workflow from sampling | + | Somebody else (working for a [[http:// |
- | Somebody else (at a [[http://www.whitelabs.com|Yeast provider]] in California) seems to be working on this, too. The american press (including the [[http://www.nytimes.com/2014/05/27/science/craft-beer-at-the-genetic-level.html?_r=0|NYT]]) did cover his project, but there is no data or report available so far. | + | ===== Team ===== |
+ | |||
+ | * [[user:heluc|Luc Henry]], ideator/scientist [[https://twitter.com/heluc|@heluc]] | ||
+ | * [[user: | ||
+ | * Soraia Binz, designer [[https://twitter.com/supsi|@supsi]] | ||
+ | * [[user: | ||
+ | |||
+ | Disclaimer: Luc Henry and Gianpaolo Rando run the BeerDeCoded | ||
===== Data ===== | ===== Data ===== | ||
- | | + | In this project, genetic data will initially be qualitative sequencing data. The idea is to transform this sequencing data into binary data about the presence=1/ |
+ | |||
+ | ==== Random data generation ==== | ||
+ | |||
+ | Since we do not have data yet, we generated a random dataset using this Octave/ | ||
+ | (generates a table containing random binary data for 10 samples and 30 parameters and calculate the euclidian distance matrix) | ||
+ | |||
+ | < | ||
+ | # | ||
+ | % lines : beer samples | ||
+ | li = 10; | ||
+ | % columns : species and sub species | ||
+ | co = 30; | ||
+ | M = round(rand(li, | ||
+ | O = zeros(li); | ||
+ | for i = 1:li | ||
+ | for j = 1:i | ||
+ | O(i,j) = norm( M(:, | ||
+ | endfor | ||
+ | endfor | ||
+ | O | ||
+ | </ | ||
+ | |||
+ | We did not plot this data yet. | ||
+ | |||
+ | ==== Metadata ==== | ||
+ | |||
+ | We searched for metadata to add to the data generated and making it look more real. Few of our findings: | ||
+ | |||
+ | | ||
Webpage of [[http:// | Webpage of [[http:// | ||
Download data [[http:// | Download data [[http:// | ||
- | ===== Team ===== | ||
- | * [[user: | + | ==== Workflow ==== |
- | * [[user:randogp|Gianpaolo Rando]], scientist [[https://twitter.com/randogp|@randogp]] | + | |
- | * Soraia Binz, designer [[https://twitter.com/supsi|@supsi]] | + | Starting from a table containing m samples (S1-Sm) and a [m x n] matrix of binary data (D1-Dn species are either present or not) for each sample, we can build a [m x m] matrix of Euclidian distances between the samples. |
- | * [[user:alog|Antoine Logean]], engineer [[https://twitter.com/ecolix|@ecolix]] | + | |
- | * and other team members | + | {{ :project: |
+ | |||
+ | The metadata entries (M1-Mn) can be attached to this matrix and used to generate a plot that contains sample points with associated characteristics. This plot contains every single sample present in the database. | ||
+ | |||
+ | A user friendly interface has to be built in order for the beer lover to choose the beers they know, and compare this " | ||
+ | |||
+ | {{ :project: | ||
+ | |||
+ | ===== Prototype ===== | ||
+ | |||
+ | We " | ||
+ | |||
+ | We have a main display with three visualisation types you can switch from: Plot (display beer samples based on Euclidian distance), Tree (same but in a tree fashion), Rank (forget distances and rank based on metadata, such as alcohol %, bitterness). | ||
+ | |||
+ | Here is the Plot: | ||
+ | |||
+ | {{ : | ||
+ | |||
+ | Here is the Tree (can switch from linear to circular representation): | ||
+ | |||
+ | {{ : | ||
+ | |||
+ | Here is the Plot being transformed into a Rank: | ||
+ | |||
+ | {{ : | ||
+ | |||
+ | Transformation from Plot to Rank was taken from the following example: | ||
+ | http://work.interactivethings.com/nzz-swiss-maps/ | ||
===== Links ===== | ===== Links ===== | ||
+ | The original pad where we worked is here (many more info from web searches) | ||
* https:// | * https:// | ||
- | * Relevant documentation ... | + | |
- | * Blog or forum posts ... | + | The kickstarter campaign of the BeerDeCoded project is here: |
- | * Tools you used ... | + | * https://www.kickstarter.com/ |
- | + | ||
+ | The wiki of the DIY biology Hackuarium association is here: | ||
+ | * http://wiki.hackuarium.ch | ||
{{tag> | {{tag> |