Académique Documents
Professionnel Documents
Culture Documents
Abstract— Data mining is the process of extracting the useful data, patterns and trends from a large amount of data by
using techniques like clustering, classification, association and regression. In this paper we are going to discuss about various
Data mining applications and techniques used recently.
1. INTRODUCTION
Data mining is the procedure of extracting knowledge from available data. The result extracted from those data can be used
in various applications like Market analysis, Data analysis in various industries like information technology, health care and
various other areas. There are two categories of functions involved in data mining they are Descriptive model and Predictive
model. Whereas Predictive model helps to analyse the data which is available assuming the unknown or unavailable data in
future, in case of Descriptive model it analyse the data with patterns and comes with new and significant information. They
follow different pattern in analysing the data. Data mining is the process of finding the information from unknown facts
and data which helps in decision making.
Data come from various resources are combined as single entity data called target data.
Data from target data are pre-processed and transformed into processed data.
The result evolved is been transformed to form some patterns by applying some data mining algorithms.
Finally those results are interpreted to form some useful information called knowledge.
Prediction is a method used in predicting the outcome based on available data by incorporating with unavailable
data sets of future.IT is mainly used in Market Analysis, Medical diagnosis, Fraud detection etc.,
Time series Analysis is a sequence of event based on the earlier events the next event is analysed. It replicates the
process being measured and there are certain components that affect the performance of a process. This includes
methods to analyse time-series data in order to extract useful patterns, trends, rules and statistics. Stock market
prediction is an important application of time- series analysis.[19]
Association rule analysis is a descriptive data mining function, which involves discovering patterns or associations
between the data. It is a method for finding the interesting relations between variables in large databases. It is
intended to identify strong rules discovered in databases using some measures of interestingness. Associative
classification evolved by combining associative rule and classification methodology. This helps to identify interesting
correlations, frequent patterns, associations or casual structures.[9]
Clustering is a process of partitioning a data set into clusters which contains set of meaningful sub-classes. It is a
descriptive data mining functions which assembles similar objects in the same cluster and dissimilar objects in
different clusters. It is a main task of exploratory data mining, and a common technique for statistical data analysis,
used in pattern, machine learning, image analysis, information retrieval, and bioinformatics.
Summarization is a simplification of data where a set of data is been summarized that results the combined
information of the data. This method helps us to identify the customer behaviour in sales and purchase of the
product.
7. CONCLUSIONS
Data mining plays a significant role in finding the different patterns and helps in discovering new ideas from a large datasets.
In this we try to group all the different Data Mining techniques based on their category of functions. With the help of these
techniques it is helping the business to take decisions on available data sets. It is also been said about the various application
areas. In future this paper aims to combine different algorithms used in different techniques and various applications that is
involved in those categories. Thus Data Mining is said to be a promising interdisciplinary area in the field of research.
REFERENCES
[1]. PhridviRaj MSB., GuruRao CV (2013) Data mining – past, present and future – a typical survey on data streams. INTER-ENG
Procedia Technology 12:255 – 263
[2]. Nikhil Bhongade.,ShrikantAkarte.,GauravSawaleA Review On:Data Mining Tools and Application.IJSTMR.,Volume 2, Issue
5, May 2017
[3]. Demšar J, Zupan B (2013) Orange: Data Mining Fruitful and Fun - A Historical Perspective. Informatica 37:55–60
[4]. SangeetaGoele, NishaChanana, “Data Mining Trend In Past, Current And Future,” International Journal of Computing & Business
Research, in Proc. I-Society 2012, 2012.
[5]. https://www.dragon1.com/terms/data-mining-definition
[6]. Spatial Data Analysis: Models, Methods and Techniques,by Jinfeng Wang and Manfred M. Fischer,5 August 2011
[7]. Usama Fayyad, Gregory Piatetsky-Shapiro, and Padhraic SmythFrom Data Mining to knowledge discovery in databases
[8]. DhruvMadanGopal, Aditya R, C Vishnu Kumar Reddy, Gautham S, Nagarathna N, “A Survey on Data Mining Applications,
Techniques and Challenges in Healthcare,” International Journal of Emerging Technologies and Innovative Research, April 2015, Volume 2,
Issue 4
[9]. P.Haripriya, R. Porkodi,A Survey Paper on Data mining Techniques and Challenges in Distributed DICOM, IJARCCE, Vol. 5,
Issue 3, March 2016
[10]. Indexing Techniques for Data Warehouses’ Queries,SirirutVanichayobon Le Gruenwald.,The University of Oklahoma School of
Computer Science Norman, OK, 73019.
[11]. R. Agrawal, H. Mannila, R. Srikant, H. Toivonen, and A. InkeriVerkamo. Fast discovery of association rules. In U. Fayyad and
et al, editors, Advances in Knowledge Discovery and Data Mining, pages 307– 328. AAAI Press, Menlo Park, CA, 1996.
[12]. Indexing and Data Access Methods for Database Mining,Ganesh Ramesh , William A. Maniatty,Dept. of Computer
Science,University at Albany ,Albany, NY 12222,Mohammed J. Zaki ,Dept. of Computer Science ,Rensselaer Polytechnic Institute,Troy, NY
12180.
[13]. Gary M. Weiss, Ph.D.,Brian D. Davison, Ph.D.,Data Mining,To appear in the Handbook of Technology Management, H. Bidgoli
(Ed.), John Wiley and Sons, 2010
[14]. Jerome H.Friedman, Department of statistics,Stanford Linear Accelerator Center, Stanford University, Stanford, CA 94305.
[15]. Tukey, J.W.(1962).The Future of Data Analysis, Ann.Statist.33,1-67.
[16]. David J. HAND., Data Mining: Statistics and More?
[17]. Bhandari,I.,Colet,E.,Parker,J.,Pines,Z.,Pratap,R., and Ramanujam,K.(1997),.Advanced Scout:Data Mining and Knowledge
Discovery in NBA Data.,Data Mining and Knowledge Discovery,1,121.125.
[18]. https://www.ibm.com/developerworks/library/ba-data-mining-techniques/
[19]. http://www.wideskills.com/data-mining/data-mining-applications
[20]. http://www.comp.dit.ie/btierney/Oracle11gDoc/datamine.111/b28129/regress.htm
[21]. Arun K Pujari, “Data Mining Techniques”, Universities India Private Limited, Hyderabad, 2001.
[22]. R. Tamilselvi., S. Kalaiselvi.,An Overview of Data Mining Techniques and Applications.,IJSR, India Online ISSN: 2319‐7064.
[23]. Jiawei Han and MichelineKamber (2006), Data Mining Concepts and Techniques, published by Morgan Kauffman, 2nd ed.
[24]. Mrs.Bharati M. Ramageri.,Data Mining Techniques and Applications.,Indian Journal of Computer Science and Engineering Vol. 1
No. 4 301-305
[25]. R.Chinnaiyan, Dr.S.Somasundaram, “Reliability Assessment of component based software systems using Test Suite -
A Review”, Journal of Computer Applications, Vol – 1, No.4, Oct – Dec 2008.
[26]. R.Chinnaiyan, Dr.S.Somasundaram, “An Experimental Study on Reliability Estimation of GNU Compiler
Components – A Review”, International Journal of Computer Applications (0975 – 8887) Volume 25– No.3, July 2011.
[27]. R.Chinnaiyan, Dr.S.Somasundaram, “Reliability of Component Based Software With Similar Software Components
- A Review”, i-Manager's Journal on Software Engineering; Nagercoil5.2 (Oct-Dec 2010).
[28]. R Chinnaiyan, S Somasundaram, “Monte Carlo Simulation For Reliability Assessment of Component Based Software
Systems”, i-Manager's Journal on Software Engineering, 2010.
[29]. R Chinnaiyan, S Somasundaram, “RELIABILITY ESTIMATION OF COMPONENT BASED
SOFTWARE SYSTEMS THROUGH MARKOV PROCESS”, International Journal of Mathematics, Computer Sciences and
Information Technology.
[30]. R.Chinnaiyan, S.Somasundaram(2010) “Evaluating the Reliability of Component Based Software Systems
“ ,International Journal of Quality and Reliability Management , Vol. 27, No. 1., pp. 78-88
[31]. R.Chinnaiyan, S.Somasundaram(2011), “An Experimental Study on Reliability Estimation of GNU
Compiler Components - A Review”, International Journal of Computer Applications,Vol.25, No.3, July 2011, pp.13-16.
[32]. R.Chinnaiyan, S.Somasundaram(2009) “Reliability of Object Oriented Software Systems using communication
variables – a review “ ,International Journal of Software Engineering, Vol.2, No.2,PP.87-96