Vous êtes sur la page 1sur 9

An approach of Data Mining Applications

towards the Health Care

Swati Pant1, Gopal Datt2 1, 2 - Dept. of Computer Science, Surajmal Agarwal Pvt. Kanya Mahavidyalaya, Kichha, Dist. U. S. Nagar, Uttarakhand, India E - Mail - swra.525@gmail.com, gdatt1986@gmail.com

ABSTRACT Generally Data mining is the process of analyzing data from different perspective and summarizing it into useful information-information that can be used to increase revenue, cuts costs, or both. Data mining software is one of a number of analytical tools for analyzing data. It allows users to analyze data from many different dimensions or angels, categorize it, and summarize the relationships identified. Technically Data mining is the process of finding correlations or patterns among dozens of fields in large relational databases. Now a days data mining is a new field for research. The main objective of data mining is, to acquire knowledge from the pool of data. In the areas of medical science, there are regulations and availability of computers, a data pool is becoming available. On other side doctors expect to use all this data in their work, but a large amount of data cannot be processed so they cannot diagnose, bode and treatment schedules. In this paper we define the data mining tools in medical science and health care applications that can help us to make accurate decisions. KEYWORDS: Data pool, Relational databases, Data mining INTRODUCTION Todays information explosion on internet has placed high and data mining can be performed on data represented in quantitative, textural, or multimedia forms. Data mining is also known as knowledge discovery in databases. Data mining should be applicable to any kind of data repository. Data mining is a synonym of another popularly used term, Knowledge discovery from data or, KDD this process consists of an iterative sequence of following steps: 1 2 3 Data cleaning(to remove noise and inconsistent data) Data integration(where multiple data sources may be combined) Data selection(where data relevant to the analysis task are retrieved from the database)

4 5 6 7

Data transformation(where data are transformed or consolidated into forms appropriate for mining by performing summary or aggregation operations) Data mining( an essential process where intelligent methods are applied in order to extract data patterns) Pattern evaluation(to identify the truly interesting patterns representing knowledge based on some interestingness measures) Knowledge presentation(where visualization and knowledge representation techniques are used to present the mined knowledge to the user) The term Data mining was introduced in 1990s but data mining is the evolution of a field with a large history. The final step of knowledge discovery from data is to verify that the patterns produced by the data mining algorithms occur in the wider data set. Data mining roots are traced back along three family lines: classical statistics, artificial intelligence, and machine learning. The foundation of most technologies is statistics on which data mining is built like- regression analysis, standard distribution, standard deviation, standard variance, discriminate variance, cluster analysis, and confidence intervals. These are used to study data and data relationships. Artificial intelligence is built on heuristics opposed to statistics, attempts to apply human thought like-processing to statistical problems. Machine learning is the union of statistics and AI. It could be considered an evolution of AI, because it blends AI heuristics with advanced statistical analysis. Although data mining is still in its infancy, companies in a wide range of industriesincluding retail, finance, healthcare, manufacturing transportation, and aerospaceare already using data mining tools and techniques to take advantage of historical data. There are two main reasons to use data mining: ; ; Too much data and too much little information There is no need to extract useful information from the data and to interpret the data

Techniques in data mining: Association rule: Association is to discover interesting associations between attributes contained in a database. This technique is also known as market basket analysis. Based on frequency counts of the number of items occur in the event. Clustering: clustering is often used to find appropriate grouping of elements for a set of data. Clustering

is a kind of undirected knowledge discovery or unsupervised learning, that is , there is no field and the relationship among the data is identified by the bottom-up approach. Decision tree: decision tree performs classification by constructing a tree based on training instances with leaves having class labels. The tree is traversed for each instance to find a leaf, and the class of the leaf is the predicted class. This is a direct knowledge discovery in the sense that there is a specific field whose value we want to predict. Neural networks: neural networks is often represented as a layered set of interconnected processors The processors nodes are frequently referred neurodes as to indicate to indicate a relationship with the neurons of the brain. Each node has weighted connection to several other nodes. In short terms the results of data mining will be in profitable. In the medium term data mining may be as common and easy to use as e-mail. Data mining consist of five major elements: ; ; ; ; ; Extract, transform, and load transaction data onto the data warehouse system Store and manage the data in a multidimensional database system. Provide data access to business analysts and information technology professionals. Analyze the data by application software Present the data in a useful format, such as graph or table.

Data mining is primarily used today by companies with a strong consumer focusretail, financial, communication, and marketing organizations. It enables these companies to determine relationships among internal factors such corporate positioning, or staff skills, and external factors such as economic indicators, competition, and customer demographics. And it enables them to determine the impact on sales, customer satisfaction, and corporate profits. Finally, it enables them to drill down into summary information to view detail transaction data. ; How it works: while large scale information technology has been evolving separate transaction and analytical system, data mining provides the link between the two. Data mining software analyze relationships and patterns in stored transaction data based on open ended user queries. Several types of analytical software are available: statistical, machine learning, and neural networks. Generally, four types of relationships: Classes: stored data is used to locate data in predetermined groups. Clusters: data items are grouped according to logical relationships or consumer preferences.

; ;

Associations: data can be mined to identify associations. ; ; ; ; ; ; Data mining consists of five major elements: Extract, transform, and load transaction data onto the data warehouse system. Store and manage the data in a multidimensional database system Provide data access to business analysis and information technology professionals. Analyze the data by application software. Present the data in a useful format, such as a graph or table.

Sequential patterns: data is mined to anticipate behavior patterns and trends.

Data mining Hidden information Raw information

The health care industry having a large amount of data, but not properly in used. The health care industry is one of the largest industries. There are thousands of hospitals, clinics, and several types of facilities. Through data mining we can find the hidden data from the database. There are some unusual data in health care industry and this data can removed using data mining techniques. Data mining is also used by companies for retail, marketing, financial, communication organizations which helps in decision making. The knowledge discovery in databases is commonly defined with the stages; 1 2 3 4 5 selection pre-processing transformation data mining interpretation

(Feelders, Daniels, and holsheimer, 2000) give six important steps in data mining process as 1. Problem definition 2. Knowledge acquisition

3. Data selection 4. Data processing 5. Analysis and interpretation 6. Reporting and use (Berthold Michael, and hand David, 1999) identify the data mining process as 1. Definition of the objectives of analysis 2. Selection & Pretreatment of the data 3. Explanatory analysis 4. Specification of statistical methods 5. Analysis of the data 6. Evaluation and comparison of the method 7. Interpretation of the chosen model Applications: Generally data mining is the process of analyzing data from different perspectives and summarize into meaningful information. Data mining software is a number of analytical tools. Now a days health care industry is one of the most information intensive industry among all where medical information, knowledge and data keep growing daily. Medical informatics plays a very important role in the use of clinical data in principle, data mining is not specific to one type of media or data. Data mining should be applicable to any kind of data of information repository. However algorithms and approaches may differ when applied to different types of data. Indeed, the challengers presented by different types of data vary significantly. Data mining being put into use and studied for databases, including relational databases, object relational databases and object oriented databases, data warehouses, transactional databases, unstructured and semi structured repositories such as www. Data, Information & Knowledge Data: The term data is often used to distinguish binary machine readable information from textural human readable information. Data are raw facts and figures or text that can be processed by a computer. Now days organizations are accumulating vast and growing amount of data in different formats and different databases. Like: ; ; ; operational and transactional data such as, sales, cost, inventory, payroll, and accounting non operational data like, industry sales, forecast data and economic data meta data such as, logical databases design or data dictionary definitions

Information: Basically the data process is call information or the patterns, associations or relationships among all this data can provide information. In its most technical sense, it is a sequence of symbols that can be interpreted as a message. Information is any kind of event that affects the state of a dynamic system. Conceptually information is the message being conveyed. To generate information it requires a massive collection of data. The data can be simple numerical figures and text documents, to more complex information Knowledge: Information converted into knowledge about historical patterns and future trends. Knowledge is a familiarity with someone or something, which can include facts, information, description, or skills acquired. And the knowledge acquisition involves complex cognitive processes. Data mining is primarily used todays by organizations and companies also, with a strong consumer focus. With the help of data mining, a retailer could use point-of-sale records of customer purchase to send targeted promotions based on an individual purchase history. Data mining tools can answer the question that traditionally was too time consuming to resolve. The healthcare industry is most information intensive industries. Medical information, knowledge and data keep growing on daily basis. It has been estimated that a hospital may generate five terabytes of data with in a year. Computer assisted information retrieval may help support quality decision making and to avoid error. Human decision making is often optimal it is poor when there are huge amounts of data to be classified. A doctor who examines 5 patient records: he or she will go through them with ease. But if the number of records increases from 5 to 50 with a time constrains, it is almost certain that the accuracy with which the doctor delivers the results will not be as high as the once obtained when he had only five records to be analyzed. This lead to the use of data mining in medical informatics, the database that is found in the hospitals, namely, the hospital information systems containing large amounts of information which includes patients informations , data from laboratories which keep on growing. With the help of data mining methods and tools, useful patterns of information can be found within the data , which will be utilize for further research and evaluation of reports. Data mining for healthcare management has been instrumental in detecting patterns of diagnosis, decisions, and treatments in healthcare. Data mining has aided in several aspects of healthcare management, including disease diagnosis, decision making for treatments, medical fraud prevention and detection, fault detection of medical devices, healthcare quality improvement strategies. Data mining was initially a success in the healthcare industry as it was used to detect fraudulent claims processing. However, since then large collection of transactional data and also data due to mergers and acquisitions have enabled businesses to reduce costs by analyzing and extracting hidden patterns.

In healthcare, data mining is becoming gradually more essential, several factors have motivated the use of data mining application in healthcare. Data mining and the efficient discovery of valuable information from a large collection of data 1990s but data mining is the evolution of field with a long history. In medical science and health care industry there is a large scope for application of data mining. Like: diagnosis of disease, health care, patient profile and history generation. In recent years data mining has been widely used in the areas of science and engineering, such as bioinformatics, genetics, medicine, education, and electrical power engineering. Data mining methods have also been applied to dissolve gas analysis (DGA) in power transformers. Healthcare management have been broadly classifieds into these categories: Diagnosis and treatment: We studied that unaided human analysis of data for decision making is unintentionally flawed. Applying data mining to even small data sets can provide protection against errorprone unaided human interface and could consequently support improved treatment decisions. Data mining could be particularly useful in medicine when there is no dispositive evidence favoring a particular treatment option. Other key areas where data mining has been proved as an effective tool are disease diagnosis, detection and prediction. Health care resource management: Here the goal is to effectively manage resource allocation by identifying high risk areas and predicting the need for and usage of various resources likes: a key problem in health care is measuring the flow of patients through hospitals and other health care facilities. If the inpatient length length of stay can be predicted efficiently, the planning and management of hospitals resources can be greatly enhanced. Customer relationship management: The principles of applying data mining for customer relationships management in other industries are also applicable to the health care industry. The identification of usage and purchase patterns and the eventual satisfaction they result in can be used to improve overall customer satisfaction. The customer could b patient, pharmacist, physician, or clines. In many cases prediction of purchasing and usage behavior can aid in designing proactive initiatives to reduce overall cost and increases customer satisfaction. Fraud and anomaly detection: Data mining has been used very successfully in aiding the prevention and early detection of medical insurance fraud. The generic framework and prominent models. A sample case study is presented to demonstrate how a certain application challenge can be addressed and the value of using the data mining as a tool.

When medical institution apply data mining on their existing data they can discover new, useful and potentially life saving knowledge that otherwise would have remained in their databases. By mining hospital records such safety issues could be flagged and addressed by hospital management and government regulators. Conclusion: In this paper we examine that how data mining tool and methods are used in healthcare management or industry. And it describes that how data mining is an effective method in healthcare industry which helps to doctors, patients, pharmacist and clines for decision making. In my paper I focused on application of data mining and analyze data of healthcare industry. References: 1: Predictive data mining for medical diagnosis, International Journal of Computer Applications (0975 8887), Volume 17 No.8, March 2011, by Jyoti Soni Student, M.Tech (CSE). Raipur Institute of Technology, Ujma Ansari Professor Raipur Institute of Technology, Dipesh Sharma Reader Raipur Institute of Technology, Sunita Soni Sr. Associate Professor Bhilai Institute of Technology. 2: Application of Data Mining Techniques in Pharmaceutical Industry, Journal of Theoretical and applied Information Technology, 2005-2007 JATIT by Jayanthi Ranjan, Information management and Technology Area Institute of management and Technology Raj nagar Ghaziabad. 3: Application of Data Mining In Medical application by Arun George Eapen. A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the degree of Master of Applied Science in Systems Design Engineering Waterloo, Ontario, Canada, 2004 Arun George Eapen 2004. 4: Data Mining Application In Health Care Application Hian Chye Koh and Gerald Tan Journal of Health Care Information Management- vol 19, no 2. 5: Data Mining System And Application International Journal of Distributed and Parallel systems (IJDPS) Vol.1, No.1, September 2010, Mr. S. P. Deshpande1 and Dr. V. M. Thakare Department of MCA, D.C.P.E, H.V.P.Mandal Amravati, India. 6: An Adaptive Parameter Free Data Mining Approach For Health Care Application, (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 3, No. 1, 2012.by Prof Dipti Patil,Asst. Professor, Computer Engg. Dept, MITCOE Pune,
Snehal Andhalkar, Mayuri Gund, Bhagyashree Agrawal, Richa Biyani, Dr. V.M.Wadhai.

7: Data Mining: Introduction and a Health Care Application, Prem Swaroop

(pswaroop@rhsmith.umd.edu); Dr Bruce Golden (bgolden@rhsmith.umd.edu) Robert H Smith School of Business, University of Maryland, College Park. 8: The Survey of Data Mining Applications And Feature Scope, Neelamadhab Padhy1, Dr. Pragnyaban Mishra 2, and Rasmita Panigrahi, Asst. Professor, Gandhi Institute of Engineering and Technology, GIET, Gunupur, Dept. of Computer Science, CMJ University, Meghalaya (Silong) a research Scholar, Associate Professor, Gandhi Institute of Engineering and Technology, GIET, Gunupur, Lecturer, Gandhi Institute of Engineering and Technology, GIET, Gunupur. International Journal of Computer Science, Engineering and Information Technology (IJCSEIT), Vol.2, No.3, June 2012. 9: DATA MINING IN HEALTHCARE:CURRENT APPLICATIONS AND ISSUES, By Ruben D. Canlas Jr., MSIT, MBA5 August 2009. By Ruben D. Canlas Jr. 10: DATA MINING FOR HEALTHCARE MANAGEMENT, Prasanna Desikan prasanna@gmail.com Center for Healthcare Innovation Allina Hospitals and Clinics USA 11: Case Study: How to Apply Data Mining Techniques in a Healthcare Data Warehouse, Michael Silver, MD, FACP, FCCP, FCCM; Taiki Sakata; Hua-Ching Su, MS; Charles Herman; Steven B. Dolins, PhD; Michael J. OShea 12: Data mining usage in health care management: literature survey and decision tree application. Mirjana Peji Bach1, Dijana osi, 1Ekonomski fakultet, Sveuilite u Zagrebu; 2Valicon d.o.o. 13: Data Mining Tasks, Techniques, And Applications, Yongjian Fu, Department Of Computer Science, University Of Missouri Rolla, MO 65409- 0350. 14: Data Mining for Healthcare Management. Prasanna Desikan, Jaideep Srivastava.