information from the pool of data is termed as data mining. There is humungous
data available in the information industry that is useless unless converted
into beneficial information and analyzed to discover any fraudulence, buyer’s
choice, to control the manufacturing of products and understand the market
mining helps the entrepreneurs to know their customers better in a way of their
choices, the deals for their money, their income and criteria by which they
like to spend. It also gives an idea how often a customer likes to spend and
makes one capable to relate different people with similar choices.
from these it also assists in cooperate sector.
mining is categorized as “Descriptive and Classification and production” on the
basis of the type of the data.
It describes the basic feature of information in database such as:
-Mining of frequent patterns
-Mining of association
-Mining of correction
-Mining of clusters
Class- The products to be sold by the company, for example, clothes.
Concept- The money being spent by the customer, shoppers or the ones who buy in
They can be gathered in
– Data Characterization: Review the data of the class to be studied namely the ‘Target
– Data Discrimination: Comparison of the class with a designated class.
MINING OF FREQUENT PATTERNS
The products (patterns) that usually are seen in transactional data are
termed as frequent patterns.
– Frequent item set: The products that are enlisted with one another such as
top and bottom wear in clothing section.
– Frequent sub sequence: The products that are generally bought with the main
item such as buying pet food followed by pet treats.
-Frequent sub structure: Graphs, trees or various other structural forms that
are attached to sub sequences.
MINING OF ASSOCIATION
The item that are generally bought together are included in this category. With
the help of this a businessman discovers a percentage of association between
products bought together such as 60 percent of times a mobile phone is bought
with a mobile cover and 40 percent of times with screen guards.
MINING OF CORRELATION
It reveals the effect of purchase of one product over another whether it has a
negative, positive or no effect at all.
MINING OF CLUSTERS
It is grouping the like similar products from one another. Each cluster
varies from the other.
class label of some items may be unknown. Classification and prediction is one
such procedure that can be utilized to uncover the data class or concepts.
This procedure is presented as:
(a) Classification (If-Then) rules
(b) Decision trees
(c) Mathematical formulae
(d) Neural networks
model that differentiates the class or concept of the information. This model
is based on the object with a well known class label.
– Prediction: Regression analysis is brought to practice to predict the
numerical values that are unknown rather than the class label. Also it is used
to identify sale trends on the basis of data available.
-Outlier analysis: The data that does not abide by the model of data available
is an outlier data.
-Evolution analysis: It
refers to those subjects which are transitional in nature.
HOW DOES THE CLASSIFICATION WORK?
incorporates two stages:
-Building the classifier or model
– Using classifier for classification
BUILDING THE CLASSIFIER
-It is a
calculations assemble the classifier
-set made from
database tuples and related class labels
-each type is
called as classification or class are known as test/question or information
USING THE CLASSIFIER
Classifier is utilized for arrangements that include
analyzing the relevance and exactness of characterization rules and thus
linking the older and new information tuples if considered adequate.
DATA MINING TASK