Vous êtes sur la page 1sur 5

Top 5

destinations

ABSTRACT AIM
D
C
B
Most
Cancellation AIM BIG DATA
Top 10
Departure
TOOLS
Delay A

OUTPUTS DISCUSSION
Maximum
Diversion
SOLUTION :-
K= FOREACHA= air_eventair_event
FOREACH
F= FOREACH GENERATE
air_event
GENERATE
DepDelay,Origin; L= Month,
FILTER
GENERATE Year, KFlightNum,
BY

B=
FlightNum,Cancelled,Cance
(Origin IS NOT null) AND
Origin,Dest;
(DepDelayllationCode;
IS NOT null);
G=FILTER
FILTER FA
BYBY Dest IS NOT
TOOLS
M= GROUP(CancellationCode=='B');
N= FOREACH
L BY Origin;
null; M GENERATE
H= GROUP group,
G BY Month;
H
C=I= GROUP
AVG(L.DepDelay);
$1 desc; D=
FOREACHO=HBORDER
BY Dest;
FOREACH C GENERATE
group,
N BY
GENERATE
G
B
Result_air3=COUNT(G.Cancelled);
group,LIMIT O 10; (B.FlightNum);
COUNT
J= ORDER I BY $1 desc; F
E=Result_air2=
ORDER DLIMITBY $1
J 1; desc;
Result_air1= LIMIT E 5;
E

PIG TABLEAU
Problem Statement
Which route 3 has seen the maximum diversion?
(origin & destination)

Top tenP=origins
FOREACHwith the highest AVG
air_event
Top ten
GENERATE
Which
Which route (origin origins
Find out the
month
& with
has
destination) topthe5has
seen highest
most AVG
visited
the most destinations.
number
departure delay
departuredue
Origin,Dest,Diverted;
ofFILTER
cancellations delay
seen the maximum
Q= (Origin IS NOT to bad weather?
P BYdiversion?
null) AND (Dest IS NOT null)
AND (Diverted==1);
R= GROUP Q BY (Origin,Dest);
S= FOREACH R GENERATE
group, COUNT(Q.Diverted);
T= ORDER S BY $1 desc;
Result _air4= LIMIT T 1;
B

M L K J I
DISCUSSION Hence in the given Project we have analyzed a
given flight data. There are around four
queries.

These queries mainly consistent delay number


of flight, origins, number of flight, destination.
..THANK YOU

Vous aimerez peut-être aussi