Learning Spark

Transféré par

eshwar152

33% ont trouvé ce document utile (9 votes)

851 vues3 pages

spark

Copyright

Formats disponibles

PDF, TXT ou lisez en ligne sur Scribd

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Signaler ce document

spark

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

33% ont trouvé ce document utile (9 votes)

851 vues3 pages

Learning Spark

Transféré par

eshwar152

spark

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

Passer à la page

Vous êtes sur la page 1sur 3

Rechercher à l'intérieur du document

Learning

Spark

Table of Contents
1. introduction

Learning Spark

Learning Spark: Lightning-Fast Big Data Analysis

Chinese translation
Translation the book of Learning Spark: Lightning-Fast Big Data Analysis is only for spark developer educational purposes.
If I violated your copyright, please let me know.
Learning Spark: Lightning-Fast Big Data AnalysisSpark

About the Author

Holden Karau is a software development engineer at Databricks and is active in open source. She is the author of an
earlier Spark book. Prior to Databricks she worked on a variety of search and classification problems at Google,
Foursquare, and Amazon. She graduated from the University of Waterloo with a Bachelors of Mathematics in Computer
Science. Outside of software she enjoys playing with fire, welding, and hula hooping.
Most recently, Andy Konwinski co-founded Databricks. Before that he was a PhD student and then postdoc in the AMPLab
at UC Berkeley, focused on large scale distributed computing and cluster scheduling. He co-created and is a committer on
the Apache Mesos project. He also worked with systems engineers and researchers at Google on the design of Omega,
their next generation cluster scheduling system. More recently, he developed and led the AMP Camp Big Data Bootcamps
and first Spark Summit, and has been contributing to the Spark project.
Patrick Wendell is an engineer at Databricks as well as a Spark Committer and PMC member. In the Spark project, Patrick
has acted as release manager for several Spark releases, including Spark 1.0. Patrick also maintains several subsystems
of Spark's core engine. Before helping start Databricks, Patrick obtained an M.S. in Computer Science at UC Berkeley. His
research focused on low latency scheduling for large scale analytics workloads. He holds a B.S.E in Computer Science
from Princeton University
Matei Zaharia is the creator of Apache Spark and CTO at Databricks. He holds a PhD from UC Berkeley, where he started
Spark as a research project. He now serves as its Vice President at Apache. Apart from Spark, he has made research and
open source contributions to other projects in the cluster computing area, including Apache Hadoop (where he is a
committer) and Apache Mesos (which he also helped start at Berkeley).

Examples for Learning Spark

codes https://github.com/gaoxuesong/learning-spark/ forked from https://github.com/databricks/learning-spark

introduction

Vous aimerez peut-être aussi

Fast Data Processing with Spark 2 - Third Edition
D'Everand
Fast Data Processing with Spark 2 - Third Edition
Krishna Sankar
Pas encore d'évaluation
Frank Kane's Taming Big Data with Apache Spark and Python
D'Everand
Frank Kane's Taming Big Data with Apache Spark and Python
Frank Kane
Pas encore d'évaluation
Learning Spark
Document4 pages
Learning Spark
roblim1
100% (1)
Spark For Python Developers - Sample Chapter
Document32 pages
Spark For Python Developers - Sample Chapter
Packt Publishing
100% (6)
Pyspark PDF
Document239 pages
Pyspark PDF
New Mahoutsukai
Pas encore d'évaluation
Apach Spark With Scala Slides
Document187 pages
Apach Spark With Scala Slides
senthilj82
Pas encore d'évaluation
Introduction To Spark For Data Engineers / Data Scientists
Document100 pages
Introduction To Spark For Data Engineers / Data Scientists
Gabriel Vieira
100% (1)
Spark Devops
Document301 pages
Spark Devops
topimaster
0% (1)
Apache Hive Essentials - Essenti - Dayong Du
Document293 pages
Apache Hive Essentials - Essenti - Dayong Du
Anjali Kaushik
100% (1)
GuideToApacheAirflow PDF
Document6 pages
GuideToApacheAirflow PDF
Piyush Kushvaha
100% (1)
Apache Spark 24 Hours PDF
Document1 129 pages
Apache Spark 24 Hours PDF
Andrei Zhoroven
100% (5)
Learning Apache Kafka - Second Edition - Sample Chapter
Document12 pages
Learning Apache Kafka - Second Edition - Sample Chapter
Packt Publishing
Pas encore d'évaluation
Kafka and Spark Streaming
Document45 pages
Kafka and Spark Streaming
manish_gupta22
Pas encore d'évaluation
Apache Spark Python Slides
Document186 pages
Apache Spark Python Slides
Douglas Leite
Pas encore d'évaluation
Apache Kafka Cookbook - Sample Chapter
Document14 pages
Apache Kafka Cookbook - Sample Chapter
Packt Publishing
100% (1)
Cassandra Datastax
Document10 pages
Cassandra Datastax
Víctor Mandujano Gutierrez
Pas encore d'évaluation
Apache Spark Tutorial
Document36 pages
Apache Spark Tutorial
vietpine
100% (3)
Developing Elegant Workflows in Python Code With Apache Airflow
Document35 pages
Developing Elegant Workflows in Python Code With Apache Airflow
Piyush Kushvaha
100% (1)
Mastering Apache Spark
Document1 831 pages
Mastering Apache Spark
Yoganand Reddy Sankepalli
100% (1)
Mastering Apache Spark - Sample Chapter
Document24 pages
Mastering Apache Spark - Sample Chapter
Packt Publishing
Pas encore d'évaluation
Mastering Apache Spark
Document1 044 pages
Mastering Apache Spark
Arjun Singh
100% (6)
Top 200 Data Engineer Interview Question PDF
Document482 pages
Top 200 Data Engineer Interview Question PDF
Surbhi Mantri
100% (1)
Data Engineering Cookbook
Document88 pages
Data Engineering Cookbook
Faton
100% (2)
Data Engineering Cookbook
Document125 pages
Data Engineering Cookbook
JosephAffonso
100% (1)
Spark in Action: Covers Apache Spark 3 with Examples in Java, Python, and Scala
D'Everand
Spark in Action: Covers Apache Spark 3 with Examples in Java, Python, and Scala
Jean-Georges Perrin
Pas encore d'évaluation
Airflow Introduction
Document9 pages
Airflow Introduction
Paresh Bhatia
Pas encore d'évaluation
Nifi HDP PDF
Document121 pages
Nifi HDP PDF
surendra s
Pas encore d'évaluation
Data Pipelines with Apache Airflow
D'Everand
Data Pipelines with Apache Airflow
Julian de Ruiter
Pas encore d'évaluation
Modern Data Engineering: The Infoq Emag / Issue #92 / February 2021
Document43 pages
Modern Data Engineering: The Infoq Emag / Issue #92 / February 2021
kolleru
Pas encore d'évaluation
8888888888888888888
Document131 pages
8888888888888888888
kumar kumar
100% (1)
Pyspark Tutorial
Document27 pages
Pyspark Tutorial
balha
100% (1)
DataOps A Complete Guide - 2020 Edition
D'Everand
DataOps A Complete Guide - 2020 Edition
Gerardus Blokdyk
Pas encore d'évaluation
Spark Databricks Summary
Document100 pages
Spark Databricks Summary
Yolanda De la Hoz Simon
75% (4)
Hadoop With Python
Document71 pages
Hadoop With Python
CarlosEduardoC.daSilva
100% (5)
Data Science With Python Workflow: Click The Links For Documentation
Document2 pages
Data Science With Python Workflow: Click The Links For Documentation
Aditya Pisupati
Pas encore d'évaluation
Airflow DAG - Best Practices: DAG As Configuration File
Document6 pages
Airflow DAG - Best Practices: DAG As Configuration File
Deepak Mane
100% (1)
Pyspark Material
Document16 pages
Pyspark Material
gokul
Pas encore d'évaluation
Apache Spark Interview Questions Book
Document15 pages
Apache Spark Interview Questions Book
Praneeth Krishna
100% (1)
Apache Spark Interview Questions
Document12 pages
Apache Spark Interview Questions
varun3dec1
Pas encore d'évaluation
Apache Airflow
Document8 pages
Apache Airflow
Bhanu Prakash
50% (2)
User Manual - DATA IKU
Document6 pages
User Manual - DATA IKU
YARA
Pas encore d'évaluation
Spark Lab Guide Ver 3.0
Document25 pages
Spark Lab Guide Ver 3.0
ahmed_sft
Pas encore d'évaluation
Mastering Apache Spark PDF
Document541 pages
Mastering Apache Spark PDF
PratiK's ViNti
75% (4)
Apache Flink
Document116 pages
Apache Flink
Aylin Koroglu
Pas encore d'évaluation
Intro To Spark Development
Document172 pages
Intro To Spark Development
dangoldin
Pas encore d'évaluation
20 Best Practices For Working With Apache Kafka at Scale - DZone Big Data
Document10 pages
20 Best Practices For Working With Apache Kafka at Scale - DZone Big Data
Suresh Maruthirao
Pas encore d'évaluation
A Deep Dive Into Query Execution Engine of Spark SQL
Document88 pages
A Deep Dive Into Query Execution Engine of Spark SQL
maghnus
100% (2)
Hadoop MapReduce v2 Cookbook - Second Edition
D'Everand
Hadoop MapReduce v2 Cookbook - Second Edition
Thilina Gunarathne
Pas encore d'évaluation
Azure Databricks A Complete Guide - 2020 Edition
D'Everand
Azure Databricks A Complete Guide - 2020 Edition
Gerardus Blokdyk
Pas encore d'évaluation
Pyspark Commands
Document12 pages
Pyspark Commands
Rambabu Giduturi
Pas encore d'évaluation
Apache Cassandra Essentials
D'Everand
Apache Cassandra Essentials
Padalia Nitin
Évaluation : 4 sur 5 étoiles
4/5 (1)
Cloudera Developer Training
Document483 pages
Cloudera Developer Training
equest7916
100% (1)
Databricks - Spark Streaming
Document55 pages
Databricks - Spark Streaming
SlavimirVesić
Pas encore d'évaluation
Spark Interview Questions
Document7 pages
Spark Interview Questions
Rajesh Sugumaran
100% (1)
Scala and Spark For Big Data Analytics
Document874 pages
Scala and Spark For Big Data Analytics
Sneha Steevan
Pas encore d'évaluation
Spark Intreview FAQ
Document21 pages
Spark Intreview FAQ
haranadhc
100% (1)
MLOps Engineering at Scale
D'Everand
MLOps Engineering at Scale
Carl Osipov
Pas encore d'évaluation
Stream Processing Using Kafka
Document46 pages
Stream Processing Using Kafka
1himaniarora
Pas encore d'évaluation
Big Data Lake
Document218 pages
Big Data Lake
Truc Nguyen Xuan
100% (4)
10 SparkBasics
Document45 pages
10 SparkBasics
Petter P
Pas encore d'évaluation
Cec l-109-14
Document5 pages
Cec l-109-14
meta
Pas encore d'évaluation
Artificial Intelligence - Edureka
Document37 pages
Artificial Intelligence - Edureka
Technical Novice
Pas encore d'évaluation
C++ Project For Graphic Scientific Calculator
Document34 pages
C++ Project For Graphic Scientific Calculator
Manish Dey
Pas encore d'évaluation
Gas Solubility
Document59 pages
Gas Solubility
somsubhra
100% (1)
Chapter Two Second Order Ordinary Differential Equation (SOODE)
Document11 pages
Chapter Two Second Order Ordinary Differential Equation (SOODE)
Benny
Pas encore d'évaluation
Science & Cooking: From Haute Cuisine To Soft Matter Science (Chemistry)
Document2 pages
Science & Cooking: From Haute Cuisine To Soft Matter Science (Chemistry)
Truc Tran
Pas encore d'évaluation
Second Periodical Exam
Document19 pages
Second Periodical Exam
Maynard Lee Estrada Gomintong
Pas encore d'évaluation
Minggu 5 Teori Akt
Document69 pages
Minggu 5 Teori Akt
HILDA
Pas encore d'évaluation
US6362718 Meg Tom Bearden 1
Document15 pages
US6362718 Meg Tom Bearden 1
Mihai Daniel
Pas encore d'évaluation
Solution To Q9 (Vii) Tut-Sheet 3 (By Professor Santanu Dey)
Document1 page
Solution To Q9 (Vii) Tut-Sheet 3 (By Professor Santanu Dey)
Prayas Jain
Pas encore d'évaluation
HowToExcel Ebook - 50 Tips To Master Excel 2017-06-11
Document41 pages
HowToExcel Ebook - 50 Tips To Master Excel 2017-06-11
Lakshmi Meruva
Pas encore d'évaluation
Yagi Antenna Desig 00 Un Se
Document232 pages
Yagi Antenna Desig 00 Un Se
frankmhowell
100% (1)
Additive Manufacturing 2
Document24 pages
Additive Manufacturing 2
Classic Printers
Pas encore d'évaluation
785 Truck Electrical System: 8GB418-UP
Document2 pages
785 Truck Electrical System: 8GB418-UP
Edwin Ruiz Vargas
Pas encore d'évaluation
Chapter 13
Document5 pages
Chapter 13
Shrey Mangal
Pas encore d'évaluation
253 968 2 SP
Document16 pages
253 968 2 SP
Alvin MR
Pas encore d'évaluation
Transformer Health Indices
Document12 pages
Transformer Health Indices
Ingenieria APA
Pas encore d'évaluation
Chapter 08
Document30 pages
Chapter 08
Max
Pas encore d'évaluation
Finishing Engl
Document49 pages
Finishing Engl
Salim Ngaos
Pas encore d'évaluation
Tutorial 4
Document3 pages
Tutorial 4
chinnu rokz
Pas encore d'évaluation
Amt 113 - Weight and Balance Lec
Document67 pages
Amt 113 - Weight and Balance Lec
Nino Angob
Pas encore d'évaluation
2010-12 600 800 Rush Switchback RMK Service Manual PDF
Document430 pages
2010-12 600 800 Rush Switchback RMK Service Manual PDF
BrianCook
73% (11)
Part Number 27-60 Revision B: Installation, Operation, and Maintenance With Illustrated Parts Breakdown
Document66 pages
Part Number 27-60 Revision B: Installation, Operation, and Maintenance With Illustrated Parts Breakdown
Luis Eduardo Albarracin Rugeles
Pas encore d'évaluation
Solutions Manual 4th Edition
Document57 pages
Solutions Manual 4th Edition
abdul5721
100% (6)
Tactical Missile Design Presentation Fleeman
Document422 pages
Tactical Missile Design Presentation Fleeman
farhadi
100% (16)
gp2 Speed Increaser
Document2 pages
gp2 Speed Increaser
mayur22785
Pas encore d'évaluation
Calin o An Informal Introduction To Stochastic Calculus With
Document331 pages
Calin o An Informal Introduction To Stochastic Calculus With
jldelafuente
100% (5)
Questions
Document9 pages
Questions
Pluto
Pas encore d'évaluation
Worksheet Binomial Distribution Problems
Document7 pages
Worksheet Binomial Distribution Problems
onek1ed
50% (2)
Wordpress The Right Way
Document62 pages
Wordpress The Right Way
Adela CC
Pas encore d'évaluation