In this third -and final- part of the way that digg stories are analyzed, i will present an example of the co-occurrence table used to find statistically significant correlation of words.
In the previous part i outlined the way stories from digg are collected and how these stories are transformed in a way suitable for analysis. This last step of the analysis, is finding out about what subjects people seem to really like and which subjects are not 'digged' so much. To do that, a co-occurrence table is used which maps words to two categories : HighDiggs (=stories that are interesting) and LowDiggs (=stories that are not interesting).
This is an example of a word co-occurrence table from IBM's UI Modeler
The statistical significance between words and categories of interestingness is denoted by colors.The more intense the color, the more higher the affinity between the word and the category.
In the example table above we see that people are interested (and therefore 'digged' more) in :
1) Stories that have pictures
2) US President George W Bush
3) Apple Leopard
4) Ron Paul (not shown in table)
On the other hand people do not 'digg' stories about :
1) Microsoft (..!)
2) Blogs (not shown in table)
In the previous part i outlined the way stories from digg are collected and how these stories are transformed in a way suitable for analysis. This last step of the analysis, is finding out about what subjects people seem to really like and which subjects are not 'digged' so much. To do that, a co-occurrence table is used which maps words to two categories : HighDiggs (=stories that are interesting) and LowDiggs (=stories that are not interesting).
This is an example of a word co-occurrence table from IBM's UI Modeler
In the example table above we see that people are interested (and therefore 'digged' more) in :
1) Stories that have pictures
2) US President George W Bush
3) Apple Leopard
4) Ron Paul (not shown in table)
On the other hand people do not 'digg' stories about :
1) Microsoft (..!)
2) Blogs (not shown in table)


