This visualisation scrapes data from theguardian.com website, and aims to show how one news article leads to many more in a 'butterfly effect' phenomenon.

The user can choose a seed news article, from which subsequent news articles 'sprout' off in different directions. These articles are related, but aim to show the story evolving in different directions. The user can keep on exploring the news in this way, expanding each story down different paths.

This visualisation was created using the D3.js library, whilst the backend analysis of news articles is done in Python/Flask. On the backend, all World News articles from the past ~3 years have been scraped using The Guardian's API, and analysed for content. As a user selects an article, around 50 similar articles occuring in the future relative to this article are gathered (the similarity matrix is pre-calculated, and incrementally updated every day). To show the user which 'directions' the story takes, K-means clustering is applied to the 50 similar articles, splitting them into 2 or 3 clusters. Each cluster should be a slightly different 'direction' for the story to subsequently follow. One article from each of the clusters is shown to the user (currently based only on similarity score, though ideally the decision of which article to show should also be based on article popularity, timescale, pagerank, etc.). See this blog entry for more detail.

We start the visualisation with one 'seed' article from The Guardian.

World News



(Or click here to pick some random articles from 2012)

Click on an article to start:

Click on an article to see how a news story evolved in different directions...


Click to start afresh