Vous êtes sur la page 1sur 7

A

Project Report
On

Processing Performance on Apache Haddop Hive Pig and MySql Cluster


Submitted in partial fulfillment of the
Requirements for the award of degree of
Master of Technology
In
Computer Science
By
K RAMESH
14H61D0504
Under the Guidance of
Mr. K. Raghavendra Rao
Assistant Professor, CSE Dept.

Department of Computer Science and Engineering

ANURAG GROUP OF INSTITUTIONS


(Formerly CVSR College of Engineering)
(An Autonomous Institution, Approved by AICTE and NBA Accredited)
1

Venkatapur, Ghatkesar, RR Dist., T.S-500088


(2014-2016)

Department of Computer Science and Engineering

ANURAG GROUP OF INSTITUTIONS


(Formerly CVSR College of Engineering)
(An Autonomous Institution, Approved by AICTE and NBA Accredited)
Venkatapur, Ghatkesar, RR Dist., T.S-500088

CERTIFICATE
This is to certify that the project entitled Processing Performance on Apache Hadoop Hive
Pig and Mysql Cluster being submitted by Mr. K Ramesh bearing the Hall Ticket Number
14H61D0504 in partial fulfillment of the requirements for the award of the degree of the Master
of Technology in Computer Science to Anurag Group of Institutions (Formerly CVSR
College of Engineering), Hyderabad is a record of bonafide work carried out by him under my
guidance and supervision from 2015 to 2016.
The results presented in this Project have been verified and found to be satisfactory. The
results embodied in this project report have not been submitted to any other University for the
award of any other degree or diploma.

Internal Guide
Mr. K. Raghavendra Rao
Assistant Professor, CSE

External Examiner

Dr. G. Vishnu Murthy


HOD, CSE Dept.

ACKNOWLEDGEMENT

It is my privilege and pleasure to express my profound sense of respect, gratitude and


indebtedness to my guide Mr. K. Raghavendra Rao, Assistant Professor, Department of
Computer Science, Anurag Group of Institutions (Formerly CVSR College of Engineering),for
his indefatigable inspiration, guidance, cogent discussion, constructive criticisms and
encouragement throughout this dissertation work.
I express my sincere gratitude to Dr. G. Vishnu Murthy, HOD and Professor,
Department of Computer Science and Engineering, Anurag Group of Institutions (Formerly
CVSR College of Engineering), for his suggestions, motivations and co-operation for the
successful completion of the work.

K RAMESH
14H61D0504

DECLARATION
I hereby declare that the project work entitled Processing Performance on Apache Hadoop
Hive Pig and MySql Cluster submitted to the JNTUH in partial fulfillment of the requirements
for the award of the degree of M.Tech in Computer Science is a record of an original work done
by me under the guidance of Mr. K. Raghavendra Rao and this project work have not been
submitted to any other university for the award of any other degree or diploma.

K RAMESH
14H61D0504
Date:

Processing Performance on Apache Hadoop Hive Pig and Mysql Cluster

ABSTRACT
MySql Cluster is a famous clustered database that is used to store and manipulate
data, The problem with MySql Cluster is that as the data grows larger, the time required to
process the data increases and additional resources may be needed. with hadoop and hive, pig
processing time can be faster than MySql Cluster, In this paper three data testers with same data
model will run simple queries and to find out at how many rows Hive or pig is faster than
MySql Cluster. The data model taken from GroupLens Research Project Showed a result that
Hive is the most appropriate for this data model in a low cost hardware environment.

Vous aimerez peut-être aussi