KEMBAR78
Data Mining | PDF | Business | Computers
0% found this document useful (0 votes)
49 views1 page

Data Mining

Data mining involves using sophisticated data analysis tools to discover previously unknown patterns and relationships in large datasets. It uses statistical models, algorithms, and machine learning methods to analyze and predict patterns. Data mining can examine data that is quantitative, textual, or multimedia, and uses techniques like association, sequence analysis, classification, clustering, and forecasting. Unlike traditional data analysis which tests hypotheses, data mining uses discovery approaches to simultaneously examine multiple relationships in data to identify unique or common patterns.

Uploaded by

sexy_ram22
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
49 views1 page

Data Mining

Data mining involves using sophisticated data analysis tools to discover previously unknown patterns and relationships in large datasets. It uses statistical models, algorithms, and machine learning methods to analyze and predict patterns. Data mining can examine data that is quantitative, textual, or multimedia, and uses techniques like association, sequence analysis, classification, clustering, and forecasting. Unlike traditional data analysis which tests hypotheses, data mining uses discovery approaches to simultaneously examine multiple relationships in data to identify unique or common patterns.

Uploaded by

sexy_ram22
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 1

Data Mining: An Overview

What is Data Mining?


Data mining involves the use of sophisticated data analysis tools to discover previously unknown, valid patterns and relationships in large data sets.1 These tools can include statistical models, mathematical algorithms, and machine learning methods (algorithms that improve their performance automatically through experience, such as neural networks or decision trees). Consequently, data mining consists of more than collecting and managing data, it also includes analysis and prediction. Data mining can be performed on data represented in quantitative, textual, or multimedia forms. Data mining applications can use a variety of parameters to examine the data. They include association (patterns where one event is connected to another event, such as purchasing a pen and purchasing paper), sequence or path analysis (patterns where one event leads to another event, such as the birth of a child and purchasing diapers), classification (identification of new patterns, such as coincidences between duct tape purchases and plastic sheeting purchases), clustering (finding and visually documenting groups of previously unknown facts, such as geographic location and brand preferences), and forecasting (discovering patterns from which one can make reasonable predictions regarding future activities, such as the prediction that people who join an athletic club may take exercise classes).2 As an application, compared to other data analysis applications, such as structured queries (used in many commercial databases) or statistical analysis software, data mining represents a difference of kind rather than degree. Many simpler analytical tools utilize a verification-based approach, where the user develops a hypothesis and then tests the data to prove or disprove the hypothesis. For example, a user might hypothesize that a customer who buys a hammer, will also buy a box of nails. The effectiveness of this approach can be limited by the creativity of the user to develop various hypotheses, as well as the structure of the software being used. In contrast, data mining utilizes a discovery approach, in which algorithms can be used to examine several multidimensional data relationships simultaneously, identifying those that are unique or frequently represented. For example, a hardware store may compare their customers tool purchases with home ownership, type of automobile driven, age, occupation, income, and/or distance between residence and

You might also like