Vous êtes sur la page 1sur 11

Data Science in General

An Introduction to Data Science


Jeffrey Stanton, 2013

School of Data Handbook


School of Data, 2015

Data Jujitsu: The Art of Turning Data into Product


DJ Patil, 2012

Interviews with Data Scientists

The Data Science Handbook


[Buy on Amazon]
Carl Shan, Henry Wang, William Chen, & Max Song, 2015

The Data Analytics Handbook


Brian Liou, Tristan Tao, & Declan Shener, 2015

Forming Data Science Teams

Data Driven: Creating a Data Culture


[Buy on Amazon]
Hilary Mason & DJ Patil, 2015

Building Data Science Teams


[Buy on Amazon]
DJ Patil, 2011

Understanding the Chief Data Officer


Julie Steele, 2015

Data Analysis

The Elements of Data Analytic Style


[Buy on Amazon]
Jeff Leek, 2015

Distributed Computing Tools

Hadoop: The Definitive Guide


[Buy on Amazon]
Tom White, 2011

Data-Intensive Text Processing with MapReduce


[Buy on Amazon]
Jimmy Lin & Chris Dyer, 2010

Learning Languages
Python

Think Python: How to Think Like a Computer Scientist


Allen Downey, 2012

Python Programming
Wikibooks, 2015

Automate the Boring Stuff with Python: Practical Programming for Total Beginners
[Buy on Amazon]
Al Sweigart, 2015

Learn Python the Hard Way


[Buy on Amazon]
Zed A. Shaw, 2013

R Programming for Data Science


Roger D. Peng,

R Programming
Wikibooks, 2014

Advanced R
[Buy on Amazon]
Hadley Wickham, 2014

SQL

Learn SQL The Hard Way


Zed. A. Shaw, 2010

SQL Tutorial
Tutorials Point

Data Mining and Machine Learning

Introduction to Machine Learning


Amnon Shashua, 2008

Machine Learning
Abdelhamid Mellouk & Abdennacer Chebira, 450

Machine Learning The Complete Guide


Wikipedia

Social Media Mining An Introduction


[Buy on Amazon]
Reza Zafarani, Mohammad Ali Abbasi, & Huan Liu, 2014

Data Mining: Practical Machine Learning Tools and Techniques


[Buy on Amazon]
Ian H. Witten & Eibe Frank, 2005

Mining of Massive Datasets


[Buy on Amazon]
Jure Leskovec, Anand Rajaraman, & Jeff Ullman, 2014

A Programmers Guide to Data Mining


Ron Zacharski, 2015

Data Mining with Rattle and R


[Buy on Amazon]
Graham Williams, 2011

Data Mining and Analysis: Fundamental Concepts and Algorithms


[Buy on Amazon]
Mohammed J. Zaki & Wagner Meria Jr., 2014

Probabilistic Programming & Bayesian Methods for Hackers


[Buy on Amazon]
Cam Davidson-Pilon, 2015

Data Mining Techniques For Marketing, Sales, and Customer Relationship


Management
[Buy on Amazon]
Michael J.A. Berry & Gordon S. Linoff, 2004

Inductive Logic Programming: Techniques and Applications


[Buy on Amazon]
Nada Lavrac & Saso Dzeroski, 1994

Pattern Recognition and Machine Learning


[Buy on Amazon]
Christopher M. Bishop, 2006

Machine Learning, Neural and Statistical Classification


[Buy on Amazon]
D. Michie, D.J. Spiegelhalter, & C.C. Taylor, 1999

Information Theory, Inference, and Learning Algorithms


[Buy on Amazon]
David J.C. MacKay, 2005

Data Mining and Business Analytics with R


[Buy on Amazon]
Johannes Ledolter, 2013

Bayesian Reasoning and Machine Learning


[Buy on Amazon]
David Barber, 2014

Gaussian Processes for Machine Learning


[Buy on Amazon]
C. E. Rasmussen & C. K. I. Williams, 2006

Reinforcement Learning: An Introduction


[Buy on Amazon]
Richard S. Sutton & Andrew G. Barto, 2012

Algorithms for Reinforcement Learning


[Buy on Amazon]
Csaba Szepesvari , 2009

Big Data, Data Mining, and Machine Learning


[Buy on Amazon]
Jared Dean, 2014

Modeling With Data


[Buy on Amazon]
Ben Klemens, 2008

KB Neural Data Mining with Python Sources


[Buy on Amazon]
Roberto Bello, 2013

Deep Learning
Yoshua Bengio, Ian J. Goodfellow, & Aaron Courville, 2015

Neural Networks and Deep Learning


Michael Nielsen, 2015

Data Mining Algorithms In R


Wikibooks, 2014

Data Mining and Analysis: Fundamental Concepts and Algorithms


[Buy on Amazon]
Mohammed J. Zaki & Wagner Meira Jr., 2014

Theory and Applications for Advanced Text Mining


Shigeaki Sakurai, 2012

Statistics and Statistical Learning

Think Stats: Exploratory Data Analysis in Python


[Buy on Amazon]
Allen B. Downey, 2014

Think Bayes: Bayesian Statistics Made Simple


[Buy on Amazon]
Allen B. Downey, 2012

The Elements of Statistical Learning: Data Mining, Inference, and Prediction


[Buy on Amazon]
Trevor Hastie, Robert Tibshirani, & Jerome Friedman, 2008

An Introduction to Statistical Learning with Applications in R


[Buy on Amazon]
Gareth James, Daniela Witten, Trevor Hastie, & Robert Tibshirani, 2013

A First Course in Design and Analysis of Experiments


[Buy on Amazon]
Gary W. Oehlert, 2010

Data Visualization

D3 Tips and Tricks


[Buy on Amazon]
Malcolm Maclean, 2015

Interactive Data Visualization for the Web


[Buy on Amazon]
Scott Murray, 2013

Big Data

Disruptive Possibilities: How Big Data Changes Everything [Buy on Amazon]


Jeffrey Needham, 2013

Real-Time Big Data Analytics: Emerging Architecture


[Buy on Amazon]
Mike Barlow, 2013

Big Data Now: 2012 Edition


[Buy on Amazon]
OReilly Media, Inc., 2012

Computer Science Topics

Natural Language Processing with Python [Buy on Amazon]


Steven Bird, 2009

Computer Vision [Buy on Amazon]


Richard Szeliski, 2010

Concise Computer Vision [Buy on Amazon]


Reinhard Klette, 2010

Artificial Intelligence A Modern Approach, 1st Edition


[Buy on Amazon (3rd Edition)]
Stuart Russell, 1995

Well, there you have it. Thousands of e-pages to read through. We hope theres something there
for everyone, no matter what level youre starting at. If you have any suggestions of free books
to include or want to review a book mentioned, please comment below and let us know!
By Alex Ivanovs, CodeCondo, Apr 29, 2014.
Data mining, data analysis, these are the two terms that very often make the impressions of being
very hard to understand complex and that youre required to have the highest grade education
in order to understand them.
I can only disagree, and as with anything in this wonderful life of ours, we only need to spend a
certain amount of time learning something, practicing it, before we realize that its not really all
that hard.
No doubt that there are very smart people in this World, working for large corporations such as
Google, Apple, Microsoft and plenty more (including security agencies), but if we continue to
look up to them; we will always think its hard, because we have never given ourselves the
chance to look at real examples and facts.
By learning from these books, you will quickly uncover the secrets of data mining and data
analysis, and hopefully be able to make better judgement of what they do, and how they can help
you in your working projects, both now and in the future.
I just want to say that, in order to learn these complex subjects, you need to have a completely
open mind, be open to every possibility, because that is usually where all the learning happens,
and no doubt your brain is going to set itself on fire; multiple times.
Data Jujitsu: The Art of Turning Data into Product

DJ Patil gives us brief introduction on the complexity of data


problems, how to look at them from a better perspective, and whether we should bother trying to
solve the impossible. He gives perfectly good and understandable examples, and is a nice little
data book to add to your collection, its quality knowledge at free of charge.
You can grab a copy of this book by filling out the fields on the right hand site. (I think filling
them blank also works)
Data Mining Algorithms In R

This Wikibook aims to fill this gap by integrating three pieces of information for each technique:
description and rationale, implementation details, and use cases.
The description and rationale of each technique provide the necessary background for
understanding the implementation and applying it to real scenarios. The implementation details
not only expose the algorithm design, but also explain its parameters, in the light of the rationale
provided previously.
Finally, the use cases provide an experience of the algorithms use on synthetic and real datasets.
A Programmers Guide to Data Mining

This book is exactly what I was talking about at the beginning of


this post, it features plenty of real-life experiences, that are aimed at beginners to help you better
understand the whole process of data manipulation, and how algorithms work.
Its apparently a work in progress, but there are plenty of chapters already available, though it
seems that the last one is a few months overdue right now. Nonetheless, the first few chapters are
essential to grasp the basics and highly recommended.
Data Mining and Analysis: Fundamental Concepts and Algorithms

This is a very high quality book that has more advanced


techniques and ways of doing things included, its still being edited / written and is set to be
released at some point, later this year. You can view the official draft by following this link
(PDF), youll be amazed at how much information there is to browse!
Its perfect for those learners who like to learn from illustrations and plenty of real-life examples.
Data Mining & Analysis in Internet Advertising
I mentioned some large companies like Google, and Apple, and the reason for that is very
simple: we see data mining and analysis everywhere, not just specific sciences and subjects.
In reality, platforms such as Google Analytics heavily depend on algorithms that have been built

on top of high quality data science knowledge, and the same goes for advertising companies,
which is the main topic of discussion in this white-paper / eBook.
An Introduction to Data Science

Jeffrey M. Stanton briefs of us on data science, and how it essentially is


more than just a set of tasks related to data mining. In his own words, its more of an art form
that, an interacts with more industries than some may believe.
In addition, data science is much more than simply analyzing data. There are many people who
enjoy analyzing data and who could happily spend all day looking at histograms and averages,
but for those who prefer other activities, data science offers a range of roles and requires a range
of skills. Lets consider this idea by thinking about some of the data involved in buying a box of
cereal.

Mining of Massive Datasets


In couple of short words, this book is perfect for those who want to learn more about data mining
on the web, and it discusses the most common set of problems when designing for the web and
working with data that the web is giving us.
It will provide you with plenty of examples and tasks to do at the end of each section, and is also
a fairly beginner friendly book; requiring of you to have some previous experience with data
algorithms, some math and database experience wouldnt hurt either.

School of Data Handbook

School of Data is a great place to be, they offer a wide variety of


courses targeted at all levels of expertise, and this Handbook is perfect alongside their course
material. What I really love about this handbook is that it gives you plenty of follow-up links on
the web, to make project creation easier.
A good example is links to websites that have previously built data sets, essential to those who
want to learn more about data and how it works!

Theory and Applications for Advanced Text Mining


We are going to conclude our list of free books for learning data mining and data analysis, with a
book that has been put together in nine chapters, and pretty much each chapter is written by
someone else; but it all makes perfect sense together.
The main focus of this book is text mining, and the evolution of web technology and how that is
making an impact on data science and overall analysis. Great book to have!
Learn Data Science from Free Books
There is no better way to learn than from books, and then going out in the world and putting that
newly found knowledge to the test, or otherwise were bound to forget what we actually had
learned. This is a beautiful list of books that every aspiring data scientist should take note of, and
add to his list of learning materials.
What books have you read in order to help you begin your own journey in data mining and
analysis? Im sure that the community would love to hear more, and Im eager to see what I
potentially let slip through my fingers myself.

COURSE BREAKDOWN
Week One
Introduction to data science and its applications. Python and SQL to manage and manipulate
data.

Week Two
Basics of statistics, probability, and linear algebra. These are the mathematical foundations of
machine learning.

Week Three & Four


Supervised Learning. Fit models to labeled data, including regression, regularization, and
classification methods.

Week Five
Unsupervised learning. We apply dimensionality reduction and clustering to unlabeled data.

Week Six
Time series. ARIMA models and other methods are used on time-dependent data.

Week Seven
Big Data. Hands-on experience with tools like Hadoop, Hive, and Spark for managing extremely
large data sets in a parallel computing environment.

Week Eight
NLP, web scraping, and topic modeling. This is data science as applied to the natural-language
text and recommendation engines.

Week Nine
Deep Learning. We cover the emerging world of artificial neural networks.

Weeks Ten, Eleven & Twelve


Capstone Project. Demonstrate learned skills and build portfolios in a final project.
Preparations for interviews, meet with recruiters, and resume experts.

Vous aimerez peut-être aussi