Académique Documents
Professionnel Documents
Culture Documents
Data mining has attracted a great deal of attention in the information industry
and in society as a whole in recent years, due to the availability of huge amounts
of data and the imminent need for turning such data into useful information and
knowledge. Today as more data are gathered, with the amount of data doubling
every three years, Data Mining is becoming an increasingly important tool to
transform these data into information. It is commonly used in a wide range of
profiling practices, such as marketing, surveillance, fraud detection and
scientific discovery.
INTRODUCTION
Data mining is the process of extracting patterns from data It is the process
of extraction of interesting (nontrivial, implicit, previously unknown and
potentially useful) patterns or knowledge from huge amount of data. It is the set
of activities used to find new, hidden or unexpected patterns in data or unusual
patterns in data. Using information contained within data warehouse, data
mining can often provide answers to questions about an organization that a
decision maker has previously not thought to ask.
Data mining techniques are the result of a long process of research and product
development. This evolution began when business data was first stored on
computers, continued with improvements in data access, and more recently,
generated technologies that allow users to navigate through their data in real
time. Data mining takes this evolutionary process beyond retrospective data
access and navigation to prospective and proactive information delivery. From
the user’s point of view, the following four steps were revolutionary because
they allowed new business questions to be answered accurately and quickly.
The field of data mining has been growing in leaps and bounds, and has shown
great potential for the future. What is the future of data mining? Certainly, the
field has made great strides in past years, and many industry analysts and
experts in the area feel that the future will be bright. There is definite growth
in the area of data mining. Many industry analysts and research firms have
projected a bright future for the entire data mining area, and its related area of
CRM (customer relationship management). The growth in the CRM Analytic
application market had approached 54.1% per year through 2003. In addition,
data mining projects had grown by more than 300% by the year 2002. By
2003, over 90% of consumer-based industries with e-commerce orientation
had utilized some kind of data mining model. As mentioned previously, the
field of data mining is very broad, and there are many methods and
technologies which have become dominant in the field.
• Rule induction: The extraction of useful if-then rules from data based
on statistical significance.
1. Identify The Objective -- Before you begin, be clear on what you hope
to accomplish with your analysis. Know in advance the business goal of the
data mining. Establish whether or not the goal is measurable. Some possible
goals are to
2. Select The Data -- Once you have defined your goal, your next step is
to select the data to meet this goal. This may be a subset of your data
warehouse or a data mart that contains specific product information. It may
be your customer information file. Segment it as much as possible the
scope of the data to be mined. Here are some key issues.
• Are the data adequate to describe the phenomena the data mining
analysis is attempting to model?
• Can you enhance internal customer records with external lifestyle and
demographic data?
• Are the data stable—will the mined attributes be the same after the analysis?
• If you are merging databases can you find a common field for linking them?
• How current and relevant are the data to the business goal?
3.Prepare The Data -- Once you've assembled the data, you must
decide which attributes to convert into usable formats. Consider the input of
domain experts—creators and users of the data.
You can postpone some of these decisions until you select a data-mining
tool. For example, if you need a neural network or polynomial network you
may have to transform some of your fields.
Balance the objective assessment of the structure of your data against your
users' need to understand the findings. Neural nets, for example, don't explain
their results.
10. Integrate The Solution -- Share the findings with all interested
end-users in the appropriate business units. You might wind up incorporating
the results of the analysis into the company's business procedures. Some of
the data mining solutions may involve
Although data mining tools automate database analysis, they can lead to faulty
findings and erroneous conclusions if you're not careful. Bear in mind that data
mining is a business process with a specific goal—to extract a competitive
insight from historical records in a database.
The telecommunication industry has quickly evolved from offering local and
long distance telephone services to provide many other comprehensive
communication services including voice, fax, pager, cellular phone, images, e-
mail, computer and web data transmission and other data traffic. The integration
of telecommunication, computer network, Internet and numerous other means of
communication and computing are underway. Moreover, with the deregulation
of the telecommunication industry in many countries and the development of
new computer and communication technologies, the telecommunication market
is rapidly expanding and highly competitive. This creates a great demand from
data mining in order to help understand business involved, identify
telecommunication patterns, catch fraudulent activities, make better use of
resources, and improve the quality of services.
• Healthcare
• Application Exploration:
Earlier data mining was mainly used for helping businesses gain a competitive
edge. But as data mining is becoming more popular it is gaining wide
acceptance in other fields also such as biomedicine, stock market, fraud
detection, telecommunication and many more. And many new explorations are
being done for this purpose. In addition for data mining for business continues
to expand as e-commerce and marketing becomes mainstream elements of the
retail industry. As generic data mining systems may have limitations in dealing
with application-specific problems, we may see a trend toward the development
of more application– specific data mining systems.
The current data mining methods capable of handling only a particular type of
data and limited amount of data, but as data is expanding at a massive rate, there
is a need to develop new data mining methods which are scalable and can
handle different types of data and large volume of data. The data mining
methods should be more interactive and user friendly. One important direction
towards improving the repair efficiency of the timing process while increasing
user interaction is constraint-based mining. This provide user with more control
by allowing the specification and use of constraints to guide data mining
systems in their search for interesting patterns.
DATA MINING: Applications and Trends 13
• Combination of data mining with database
systems, data warehouse systems, and web database
systems
Database systems, data warehouse systems, and WWW are loaded with huge
amounts of data and have thus become the major information processing
systems. It is important to make sure that data mining serves as essential data
analysis component that can be easily included in to such an information-
processing environment. The desired architecture for data mining system is the
tight coupling with database and data warehouse systems. Transaction
management query processing, online analytical processing and online
analytical mining should be integrated into one unified framework.
The complex types of data like geospatial, multimedia, time series, sequence
and text data poses an important research area in field of data mining. There is
still a huge gap between the needs for these applications and the available
technology.
• Web mining
DATA MINING: Applications and Trends 14
The World Wide Web is huge collection of globally distributed collection of
news, advertisements, consumer records, financial, education, government, e-
commerce and many other services. The WWW also contains huge and dynamic
collection hyper linked information, providing a huge source for data mining.
Based on the above facts, the
Web also poses great challenges for efficient resource and knowledge discovery.
5.REFERENCES