Vous êtes sur la page 1sur 3

Abstract:

The main aim of this paper is to expound about the knowledge discovery and
data mining.Datawarehouse brings the information from multiple
sources so as to provide a consistent databasesource for decision
support queries and off-load decision support applications from the
on-linetransaction system. Here, data is available but not information and
not the right information at ther i g h t t i m e . D a t a m i n i n g i s
extracting interest information or patte rns from data in
l a r g e databases.For processing the data now there are many traditional and
statistical methods of dataanalyses and spreadsheets are used to obtained
informative reports from data but they can’t givethe knowledge from
data. So, different algorithms are used to get knowledge from
database.Closet is an efficient algorithm which is scalable on large
databases in order to get the importantknowledge hidden inside the data.
This algorithm uses association rule. which implies
certainassociation relationships among a set of objects in a database. In this
process a set of associationrules are discovered at multiple levels of
abstraction from the relevant sets of data in a database.Association is applied
on frequent closed set items i.e... instead of mining association rules on
allthe item sets one can mine association rules on frequent closed item sets
only. This is interestinga l t e r n a t i v e i n t h i s C l o s e t a l g o r i t h m
, wh i c h i s p r o p o se d r e c e n t l y a n d h a s t h e s a me p o w e r
a s association mining but substantially reduces the number of rules
to be present and increase bothefficiency and effectiveness of mining.

Introduction:
Today computerization of many business and government
transactions related to activities anddecisions generates tremendous
amounts of data stored in databases, data warehouses and
other information repositories by large and simple transaction i.e. tax
returns, telephone calls, businesst r i p s , p e r f o r ma n c e t e s t s a n d
p r o d u c t w a r r a n t y r e g i s t r a t i o n wh i c h a r e b e i n g h a n d l e d
t h r o u g h computer .So, there is an urgent need for a new generation of
computational theories and tools toassist humans in extracting useful
information (knowledge) from the rapidly growing volumes of digital data.
A data warehouse is a subject-oriented, integrated, time-variant,
and nonvolatilecollection of data in support of management’s
decision-making process. The warehouse usuallyresides on its own
server and is separate from the transaction-processing or “run-the-
business”s ys t e ms . We a r e d r o wn i ng i n d a t a , b u t s t a r v i n g f o r
k n o wl e d g e ! S o , w e e x t r a c t i n t e r e s t i n g knowledge from data in
large databases which is known as data mining.
Knowledge Discovery in Databases:
Knowledge Discovery in Databases (KDD) is the non-trivial process of
identifying valid, novel, potentially useful, and ultimately
understandable patterns in data A KDD process includes
datacleaning, data integration, data selection, transformation, data
mining, pattern evaluation, andknowledge presentation.

Knowledge Discovery in Databases (KDD) has made great progress


incommercial, industrial, administrative and other applications. The
emerging technology KDDhaving a multi step process which uses
Data Mining Methods (Algorithms)to extract (Identify) what is hidden
knowledge in the data according to specifications of measures The basic
problemaddressed by the KDD process is one of mapping glow level
data into other forms that might bemore compact, more abstract ,or
more useful.Data mining is a step in the KDD process that consists of
applying data analysis and discoveringalgorithms. Understanding data
mining and model induction at this component level clarifies the behavior
of any data mining algorithm and makes it easier for the user to
understand its overallc o n t r i b u t i o n a n d a p p l i c a b i l i t y t o t h e
K D D p r o c e s s . Da t a mi n i n g , a l s o k no wn a s k n o wl e d g e -
discovery in databases (KDD), is the practice of automatically
searching large stores of data for

p a t t e r n s . To d o t h i s , d a t a mi n i n g u s e s c o mp u t a t i o na l
t e c h n i q u e s f r o m s t a t i s t i c s a n d p a t t e r n recognition.Problem
Analysis is based on manual procedure. The main function is to
understandingapplication domain and requirements of user related to
developing prior knowledge for domain.Selection of Target data is for
creating target data set and Selecting a data set or its subset
onwhich discovery is to be performed by automatic way. Data Processing
which is the third step of KD D p r o c e s s i n v o l v e s r e mo v i n g n o i s e /
h a n d l i n g mi s s i n g d a t a b a s e d o n a u t o ma t i c
p r o g r a m. T r a n s f o r ma t i o n o f Da t a i s a p r o c e d u r e wh i c h i s
ma d e ma n u a l l y wh e r e d a t a r e d u c t i o n a n d projection are made
and finding useful fields/features/attributes of data according to
goal of the problem. Data Mining selects data mining goal and
chooses a method according to task andknowledge .It analyzes and
verifies knowledge .It is based on automatic manner. Output Analysisand
Review evaluates the knowledge and transforms knowledge.

Vous aimerez peut-être aussi