Académique Documents
Professionnel Documents
Culture Documents
Codeisacommodity
http://www.flickr.com/photos/ecstaticist/1120119742/
Whatsthecentralmythunderlyingbigdata?
Themyththatdrovethegoldrush
All we need is a fat pipe and pans working in parallel
Youchangeanorgbyac.ngwith,throughothers,notalone.
Evolu5onofdata
50s60s:dataasproduct 70s80s:dataasbyproduct 90s00s:dataasasset 2010s+:dataassubstrate
Therealdatarevolu.onisin businessstructureand processesandhowtheyuse informa.on.
Everythingissodierentnow
Manycurrentapproachesmissthepoint
UsingBigData
Itsnotaboutbig
UsingBigData
And big is often not as big as you think it is.
Itsnotreallyaboutdata,either
UsingBigData
If theres no process for applying information in a specific context then you are producing expensive trivia.
Wheredoesthevalueindatacomefrom?
Formostofusinnondata businesses,thistranslates toHowcanweuse informa.ontoimprove thedecisionsmadeinour organiza.on? Weneedtofocusonthat singularlybaddecision makingenDty,thegroup. OrganizaDonsseemto amplifyinnatedecision makingaws.
Decisionmakingreali5es
TheoperaDngmodelinsenior managementisprimarilyintuiDonand paKernbased. Themodeformiddlemanagementis poliDcal,bureaucraDc. Newdataisdestabilizing,whichiswhy youmayhitawalltryingtopushyour datadrivenagenda. Dataiscontextual,soweneedstories toexplainhowwethinktheworld works,whymydataisbeKerthan yours,andwhyyourtheorysucks. CogniDvebiascreatesamorassfor interpretaDon.
Averyabstractbusinessintelligencemodel
Whoarethepeoplemakingdecisions?
Strategic TacDcal OperaDonal
Whatisthenatureoftheirdecisions?
Scope,Dmeframeofdecision,Dmescaleofdata,data volume,breadthofdata,frequency,paKernvsfactbased
Analytic complexity
Theprocessaspectofdecisions5estopeople
ScopeofcontrolforpeopleinmostorganizaDonsaligns: inprocess,onprocess,overprocess
Whatkindofsupportdotheyhavetoday?
Other people
Email, meetings
Reality of most reports and dashboards is that they provide basic monitoring at best.
Howandwherecanyouapplydatasolu5ons?
Highsinglevalue,less frequent,soimprovethe eecDvenessofindividual decisions. Fuzzymiddleground Lowsinglevalue,frequent, canimprovetheeciency ortheeecDvenessforlarge aggregateimprovement.
Analytic complexity
Whatdopeopledowithdata?
1. Describe:usedatatocharacterizeacurrentorpriorstateofthe system,forexamplemonitoringandidenDfyingexcepDons 2. Inves5gate:exploredatatodiscovertheboundariesand characterisDcsofasystem,frameaproblemornd supporDng/discrediDngevidence. 3. Explain:usedataandanalyDcmethodstodeterminecauses andeects,buildmodelsandconstructstories. 4. Predict:applyanalyDcmodelstodeterminepossible/probable futurestatesofthesystem 5. Prescribe:usedatainmodelstodenepolicy,procedure,and rulesfortakingacDon,andpossiblyautomatethem Datainfrastructureandtoolsupportfortheseac.vi.esinmost organiza.onsisunevenatbest,decreasingasyoumovedown.
Ifyouwanttobeadatascien1st,orbuildso5waretosupportthem,readthispaper
Structure
Eort
Figure:PirolliandCard,2005
Third Nature is a research and consulting firm focused on new and emerging technology and practices in business intelligence, data integration and information management. If your question is related to BI, open source, web 2.0 or data integration then youre at the right place. Our goal is to help companies take advantage of information-driven management practices and applications. We offer education, consulting and research services to support business and IT organizations as well as technology vendors. We fill the gap between what the industry analyst firms cover and what IT needs. We specialize in product and technology analysis, so we look at emerging technologies and markets, evaluating the products rather than vendor market positions.