Vous êtes sur la page 1sur 85

From publisher to platform: How the Guardian embraced the internet using content, search, and Open Source

Stephen Dunn, Guardian News and Media stephen.dunn@guardian.co.uk, 25th May, 2011 Twitter: @cuica, @openplatform

Thursday, 26 May 2011

From publisher to platform


How the Guardian embraced the Internet using content, search, and Open Source
Stephen Dunn, Guardian News and Media
2

Thursday, 26 May 2011

The publishing era

Thursday, 26 May 2011

We started a long time ago:

Thursday, 26 May 2011

Keyword page Live blogs Apps Mobile site

Twitter updates

Swine flu

Comment

Content partnerships

Newspapers Audio Video Open platform API

Thursday, 26 May 2011

To secure the financial and editorial independence To secure the financial and editorial independence of the Guardian in perpetuity. To promote freedom in thein perpetuity of the Guardian press and liberal journalism globally. To become the world's leading liberal voice.
journalism globally To promote freedom in the press and liberal

Thursday, 26 May 2011

Open Web Principles

Thursday, 26 May 2011

2009

Thursday, 26 May 2011

1. Permanent

http://www.ickr.com/photos/fstorr/

A cool URI is one that does not change 1.5 million resources redirected to new scheme

Tim Berners-Lee 1998

Thursday, 26 May 2011

2. Addressable
Resources are about something - ready for the social web. We live in the age of point-at-things (Coates 2005)

10

Thursday, 26 May 2011

3. Discoverable

Multiple routes to content

Tagging drives discovery

11

Thursday, 26 May 2011

4. Open

12

Thursday, 26 May 2011

Example: The Hackable Guardian


http:// www.guardian.co.uk/....
/technology/internet /rss /technology/all /rss /environment/climatechange +business/globaleconomy/rss

Thursday, 26 May 2011

Results...

14

Thursday, 26 May 2011

Site trafc growth


30,000,000 26,250,000 22,500,000

Final Release

Unique Users
First release

Unique Users

18,750,000 15,000,000 11,250,000 7,500,000 3,750,000

Pre - project

40M

Sep 2005

Oct 2006

Nov 2007

Dec 2008
15

Thursday, 26 May 2011

However...

16

Thursday, 26 May 2011

1 Billion+ Internet Users!

17

Thursday, 26 May 2011

18

Thursday, 26 May 2011

19

Thursday, 26 May 2011

20

Thursday, 26 May 2011

...How I stopped worrying about my website and learned to love the whole internet. Matt McAlister
21

Thursday, 26 May 2011

The Open Strategy


OPEN IN Bring in data and apps from the Internet
OPEN OUT

Enable partners to build applications using Guardian content and services for other platforms

22

Thursday, 26 May 2011

23

Thursday, 26 May 2011

"Our most interesting experiments lie in combining what we know with the experience, opinions and expertise of the people who want to participate rather than passively receive.
24

Thursday, 26 May 2011

25

Thursday, 26 May 2011

26

Thursday, 26 May 2011

27

Thursday, 26 May 2011

28

Thursday, 26 May 2011

29

Thursday, 26 May 2011

30

Thursday, 26 May 2011

31

Thursday, 26 May 2011

32

Thursday, 26 May 2011

33

Thursday, 26 May 2011

Jack Shenker
The Guardian alongside Al Jazeera was the one news source that everybody on the streets in Tahrir - not just in Cairo but in surrounding cities and major centers of revolutionary activity that people were talking about.
34

Thursday, 26 May 2011

The Open Strategy


OPEN IN Bring in data and apps from the Internet
OPEN OUT

Enable partners to build applications using Guardian content and services for other platforms

35
22

Thursday, 26 May 2011

The Open Platform

36

Thursday, 26 May 2011

The suite of services enabling partners to build applications with the Guardian

37

Thursday, 26 May 2011

OPEN IN Bring in data and apps from the Internet

OPEN OUT

Enable partners to build applications using Guardian content and services for other platforms

38
22

Thursday, 26 May 2011

CONTENT API
A service for selecting and collecting content from the Guardian for re-use

DATA STORE
A directory of useful data curated by Guardian editors

POLITICS API
Open database of candidates, voting records, constituencies, election results, live data on election day

Thursday, 26 May 2011

Mutualised news!

40

Thursday, 26 May 2011

Mutualised news!

41

Thursday, 26 May 2011

Mutualised news!

42

Thursday, 26 May 2011

43

Thursday, 26 May 2011

44

Thursday, 26 May 2011

45

Thursday, 26 May 2011

46

Thursday, 26 May 2011

DATA STORE
A directory of useful data curated by Guardian editors

Thursday, 26 May 2011

POLITICS API
Open database of candidates, voting records, constituencies, election results, live data on election day

Thursday, 26 May 2011

POLITICS API
Open database of candidates, voting records, constituencies, election results, live data on election day

49

Thursday, 26 May 2011

<OBLIGATORY DOGFOOD SLIDE >

50

Thursday, 26 May 2011

51

Thursday, 26 May 2011

Thursday, 26 May 2011

Thursday, 26 May 2011

Thursday, 26 May 2011

Thursday, 26 May 2011

Open for Business

56

Thursday, 26 May 2011

3 Tiers of access 3 Revenue models


Keyless: Take our headlines. You keep associated
revenues.

Approved: Take our full article content, but with an


advert. Guardian keeps ad revenue, you keep rest-ofpage revenue.

Bespoke: Take, reformat, augment our content


Revenue model to be negotiated. Combination of Media, Fees, Downloads.

57

Thursday, 26 May 2011

58

Thursday, 26 May 2011

What this means


Open Out: Developers can now access full content APIs on demand with keys post-approved Platform is positioned as a place to do business So rapid scalability, reliability and performance are now core requirements

59

Thursday, 26 May 2011

OPEN IN Bring in data and apps from the internet

OPEN OUT Allow partners to build applications using Guardian content and services for other platforms

Thursday, 26 May 2011

MICROAPPS
A framework for integrating 3rd party applications into guardian.co.uk

Simple REST/HTTP framework allows lightweight development Applications proxied for performance Apps generally hosted in the cloud, allows hot deployment into production

61

Thursday, 26 May 2011

MICROAPPS A framework for integrating 3rd party applications into guardian.co.uk

62

Thursday, 26 May 2011

What could I cook?

Thursday, 26 May 2011

Bringing it together

64

Thursday, 26 May 2011

65

Thursday, 26 May 2011

App showcase

66

Thursday, 26 May 2011

From publisher to platform


Seeking massive growth, but no longer only broadcasting content on the website User/partner engagement & contribution on Journalism data software applications revenue and ads Support developers and partners with data and APIs, need scalability, reliability, speed
67

Thursday, 26 May 2011

Evolving the architecture

68

Thursday, 26 May 2011

Web server

Web server

Web server

App server

App server

App server

Memcached (added later)

Oracle

CMS

Thursday, 26 May 2011

Web server

Web server

Web server

Why RDBMS?
App server App server App server

5 years ago, fewer alternatives


Memcached Understand operations procedures

Can easily recruit DBAs / devs Developer/ops tools Business critical system: a safe choice
CMS Oracle

Thursday, 26 May 2011

Scaling trafc
30,000,000 26,250,000 22,500,000

Unique Users

Unique Users

18,750,000 15,000,000 11,250,000 7,500,000 3,750,000

Sep 2005

Sep 2006

Sep 2007

Sep 2008
71

Thursday, 26 May 2011

72

Thursday, 26 May 2011

73

Thursday, 26 May 2011

74

Thursday, 26 May 2011

75

Thursday, 26 May 2011

76

Thursday, 26 May 2011

77

Thursday, 26 May 2011

We chose Solr/Lucene
Can perform complex queries, including full-text search We can change the schema with no downtime Most queries are of similar cost Scales very well horizontally Just worked in the cloud No strange control processes/engines Developers just loved working with it!
78

Thursday, 26 May 2011

79

Thursday, 26 May 2011

Web servers App server Memcached

Api Solr Solr Solr

RDBMS

Solr

Solr Solr Cloud, EC2


80

CMS

Thursday, 26 May 2011

What about Open In?


OPEN IN Bring in data and apps from the Internet
OPEN OUT

Enable partners to build applications using Guardian content and services for other platforms

81
22

Thursday, 26 May 2011

Apps
Web servers

Proxy

App App App App App App


external hosting app engine etc

App server Memcached

RDBMS

CMS

82

Thursday, 26 May 2011

Core In Web servers App App App App App App


external hosting app engine etc

Out

Solr App server Solr Memcached Solr

CMS

Proxy

Solr

Solr Solr Cloud, EC2


83

rdbms

Thursday, 26 May 2011

84

Thursday, 26 May 2011

85

Thursday, 26 May 2011

Vous aimerez peut-être aussi