In this article you are going to know about What is Data Mining ? ,in simple way as well as how does it work?.Data warehouse is regarded as a type of mine , where the data is the ore,and the new
useful information is the precious find.Data Mining is the process of selecting ,exploring and modeling large amounts of data to discover previously unknown relationship that can support decision making.
useful information is the precious find.Data Mining is the process of selecting ,exploring and modeling large amounts of data to discover previously unknown relationship that can support decision making.
Stages in Data Mining:
The phase of depicted start with the raw data and finish with the extracted knowledge which was acquired as a result of the following stages:
-Selection:
Selecting or segmenting the data according to some criteria e.g. all those people who own a car,in this way subsets of the data can be determined.
-Preprocessing:
This is the data cleansing stage where certain information is removed which is deemed unnecessary and may slow down queries for example unnecessary to note the sex of a patient when studying pregnancy.Also the data is reconfigured to ensure a consistent format as there is a possibility of inconsistent formats because the data is driven from several sources e.g.sex may recorded as for m
and also as 1or 0.
-Transformation:
The data is not merely transferred across the transformed in that overlays may added such as the demographic overlays commonly used in market research .the data is made useable and navigable.
-Data Mining:
This stage is concerned with the extraction of patterns from the data.A pattern can be defined as given a set of facts F,a language L,and some measure of certainty C ,a pattern is a statement S in L
that describes relationships among a subsets Fs of F with a certainty c such as S is simpler in some sense than the enumeration of all the facts in Fs.
-Interpretation and evaluation:
The patterns identified by the system are interpreted into knowledge which can then be used to support human decision -making e.g .prediction and classification tasks ,summarizing the contents of a database or explaining observed phenomena.
Use Of Data Mining;
Some data mining tools are complex statistical analysis applications ,and others use additional tools which go beyond statistical analysis and hypothesis testing.While some tools help find predefined relationships and ratios , other techniques are also used in data mining , including artificial intelligence techniques in decision support and expert systems.
To illustrate the difference between traditional queries and data-mining queries ,consider the following examples.A typical traditional query would be-" What is the relationship between the amount of product A and Product B that we sold over the past quarter?" A typical data mining query
would be : " discover two products most likely to sell together on a weekend." The latter query lets the software find patterns that would otherwise not be detected through observations. While data has traditionally been used to see weather this or that pattern exits, data mining allows you to ask what pattern exits .Thus ,some experts say that in data mining you let the computer answer questions that you do not know to ask. The combination of data warehosing techniques and data mining software makes it easier to predict future outcomes based on patterns discovered within historical data.