0% found this document useful (0 votes)

43 views3 pages

Question On Data Mining

The document provides an overview of the R programming language, including its data types, structures, and functions for data manipulation. It also covers concepts in data mining, such as classification, regression, and clustering, along with machine learning techniques like supervised and unsupervised learning. Additionally, it introduces RStudio as an IDE for R and explains how to create user-defined functions.

Uploaded by

Surajit Acharya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views3 pages

Question On Data Mining

Uploaded by

Surajit Acharya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 3

1. What is R?

R is a programming language and environment widely used for solving data science
problems and particularly designed for statistical

2. List and define some basic data types in R.

There are a few data types in R, including:
Numeric—decimal numbers.
Integer—whole numbers.
Character—a letter,number, or symbol, or any combination of them, enclosed in
regular or single quotation marks.
Factor—categories from a predefined set of possible values, often with an intrinsic
order.
Logical—the Boolean values TRUE and FALSE, represented under the hood as 1 and 0,
respectively.

3. List and define some basic data structures in R.

Vector => a one-dimensional data structure used for storing values of the same data
type.
List => a multi-dimensional data structure used for storing values of any data type
and/or other data structures.
Matrix => a two-dimensional data structure used for storing values of the same data
type.
Data frame => a two-dimensional data structure used for storing values of any data
type, but each column must store values of the same data type.

4. How to import data in R?

The base R provides essential functions for importing data:
read.table()—the most general function of the base R for importing data, takes in
tabular data with any kind of field separators, including specific ones, such as |.
read.csv()—comma-separated values (CSV) files with . as the decimal separator.

5. What is a package in R, and how do you install and load packages?

To install an R package directly from CRAN, we need to pass the package name
enclosed in quotation marks to the install.packages()
To load an installed R package in the working R environment, we can use either
library() or require() functions.

6. How do you add a new column to a data frame in R?

Using the $ symbol:

df <- data.frame(col_1=10:13, col_2=c("a", "b", "c", "d"))
print(df)

df$col_3 <- c(5, 1, 18, 16)

print(df)

Using square brackets:

df <- data.frame(col_1=10:13, col_2=c("a", "b", "c", "d"))
print(df)

df["col_3"] <- c(5, 1, 18, 16)

print(df)

Using the cbind() function:

df <- data.frame(col_1=10:13, col_2=c("a", "b", "c", "d"))
print(df)

df <- cbind(df, col_3=c(5, 1, 18, 16))

print(df)
7. What is RStudio?
RStudio is an open-source IDE (integrated development environment) that is widely
used as a graphical front-end for working with the R programming language starting
from version 3.0.1. It has many helpful features that make it very popular among R
users:
User-friendly
Flexible
Multifunctional
Allows creating reusable scripts
Tracks operational history
Autocompletes the code
Offers detailed and comprehensive help on any object

8. How to create a user-defined function in R?

To create a user-defined function in R, we use the keyword function and the
following syntax:
function_name <- function(parameters){
function body
}

9.What is Data Mining?

Data mining refers to extracting or mining knowledge from large amounts of data. In
other words, Data mining is the science, art, and technology of discovering large
and complex bodies of data in order to discover useful patterns.

10. What are the different tasks of Data Mining?

The following activities are carried out during data mining:

Classification
Clustering
Association Rule Discovery
Sequential Pattern Discovery
Regression
Deviation Detection

11. What is Classification?

Classification is the processing of finding a set of models (or functions) that
describe and distinguish data classes or concepts, for the purpose of being able to
use the model to predict the class of objects whose class label is unknown.
Classification can be used for predicting the class label of data items.

12. What is Prediction?

Prediction can be viewed as the construction and use of a model to assess the class
of an unlabeled object, or to measure the value or value ranges of an attribute
that a given object is likely to have. In this interpretation, classification and
regression are the two major types of prediction problems where classification is
used to predict discrete or nominal values, while regression is used to predict
incessant or ordered values.

13. What is Decision tree?

A Decision tree is a classification scheme that generates a tree and a set of
rules, representing the model of different classes, from a given data set.

14. Explain Bayesian classification in Data Mining?

A Bayesian classifier is a statistical classifier. They can predict class
membership probabilities, for instance, the probability that a given sample belongs
to a particular class. Bayesian classification is created on the Bayes theorem. A
simple Bayesian classifier is known as the naive Bayesian classifier to be
comparable in performance with decision trees and neural network classifiers.

15. What do you understand by the term Cluster Analysis?

In the context of Data Mining, the term cluster analysis is an important type of
analysis that is used in market research, pattern recognition, data analysis, and
image processing, etc.

16. What is regression in Data mining?

Regression is used to evaluate or measure the change in one variable with respect
to another, establishing a linear relationship between them.

17. What is KMeans clustering?

The KMeans algorithm clusters data by trying to separate samples in n groups of
equal variance, minimizing a criterion known as the inertia or within-cluster sum-
of-squares. This algorithm requires the number of clusters to be specified. It
scales well to large number of samples and has been used across a large range of
application areas in many different fields.

18. What is supervised learning?

Supervised learning is the machine learning task of inferring a function from
labeled training data. The training data consist of a set of training examples. In
supervised learning, each example is a pair consisting of an input object
(typically a vector) and a desired output value (also called the supervisory
signal).

19. What is unsupervised learning?

Unsupervised learning is a type of machine learning algorithm used to draw
inferences from datasets consisting of input data without labeled responses. The
most common unsupervised learning method is cluster analysis, which is used for
exploratory data analysis to find hidden patterns or grouping in data.

Data Science Interview Questions
No ratings yet
Data Science Interview Questions
31 pages
R Programming for Data Science
No ratings yet
R Programming for Data Science
13 pages
Data Science Selection Questions and Their Answers 2022
No ratings yet
Data Science Selection Questions and Their Answers 2022
5 pages
Ds Revision 1
No ratings yet
Ds Revision 1
5 pages
Da Question Bank
No ratings yet
Da Question Bank
7 pages
UNIT 4 Data Science Notes
100% (1)
UNIT 4 Data Science Notes
4 pages
Dwdmsem 6 QB
No ratings yet
Dwdmsem 6 QB
13 pages
B Ei
No ratings yet
B Ei
44 pages
Cls10datascience 24082024 113123
No ratings yet
Cls10datascience 24082024 113123
4 pages
Data Science Interview
No ratings yet
Data Science Interview
132 pages
A) What Is Big Data?
No ratings yet
A) What Is Big Data?
7 pages
R Viva Questions
100% (1)
R Viva Questions
4 pages
BA Viva Questions
No ratings yet
BA Viva Questions
8 pages
Data Science Q&A for Class X
No ratings yet
Data Science Q&A for Class X
4 pages
Data Science
No ratings yet
Data Science
49 pages
Statistics and ML
No ratings yet
Statistics and ML
11 pages
Crack Data Science Interview 1731300339
No ratings yet
Crack Data Science Interview 1731300339
132 pages
Important Questions
No ratings yet
Important Questions
26 pages
Simplified Viva EDA
No ratings yet
Simplified Viva EDA
7 pages
Data Analytics 2marks PDF
100% (1)
Data Analytics 2marks PDF
13 pages
Quiz 4 5 6
No ratings yet
Quiz 4 5 6
11 pages
Unit 1
No ratings yet
Unit 1
34 pages
Da 1733591326
No ratings yet
Da 1733591326
132 pages
Minor Unit 3-5 2 Marks
No ratings yet
Minor Unit 3-5 2 Marks
4 pages
Data Science
100% (1)
Data Science
7 pages
Data Science
No ratings yet
Data Science
32 pages
5 What Is Data-WPS Office
No ratings yet
5 What Is Data-WPS Office
19 pages
Unit 2
No ratings yet
Unit 2
2 pages
2 Marks With Answers
No ratings yet
2 Marks With Answers
12 pages
Data Science QnA
No ratings yet
Data Science QnA
15 pages
UNIT 4 Data Science
No ratings yet
UNIT 4 Data Science
7 pages
Data Science Tool Box Important Viva Question
No ratings yet
Data Science Tool Box Important Viva Question
14 pages
DM - Midsem - Question Bank
No ratings yet
DM - Midsem - Question Bank
5 pages
Data Science Short Notes
No ratings yet
Data Science Short Notes
21 pages
Data Warehouse 1
No ratings yet
Data Warehouse 1
21 pages
Top 80 R Programming Interview Questions
No ratings yet
Top 80 R Programming Interview Questions
11 pages
ADS Viva
No ratings yet
ADS Viva
55 pages
Top Data Science Interview Questions and Answers in 2023 PDF
100% (1)
Top Data Science Interview Questions and Answers in 2023 PDF
14 pages
R Basic and Data Mining Methods Basics
No ratings yet
R Basic and Data Mining Methods Basics
2 pages
Machine Learning Viva Questions
No ratings yet
Machine Learning Viva Questions
6 pages
R Basic Viva Questions
No ratings yet
R Basic Viva Questions
3 pages
R Programming 2 MARKS
No ratings yet
R Programming 2 MARKS
12 pages
R Programming
No ratings yet
R Programming
7 pages
Data Minig Anwers
No ratings yet
Data Minig Anwers
37 pages
Data Science
No ratings yet
Data Science
28 pages
DS - Sample Questions (Practical)
No ratings yet
DS - Sample Questions (Practical)
8 pages
Datascience Interview
100% (1)
Datascience Interview
31 pages
PI Kit - MBA Admissions 2023
No ratings yet
PI Kit - MBA Admissions 2023
50 pages
Data Analysis Questions
No ratings yet
Data Analysis Questions
6 pages
Ch-04: Data and Analysis - Short Question and Answers - PDF
No ratings yet
Ch-04: Data and Analysis - Short Question and Answers - PDF
10 pages
CS3352-QB Fds
No ratings yet
CS3352-QB Fds
12 pages
Data Science and Big Data Analysis Mcqs
No ratings yet
Data Science and Big Data Analysis Mcqs
53 pages
Data Scientist Interview Questions and Answers PDF
No ratings yet
Data Scientist Interview Questions and Answers PDF
37 pages
4 (John Stredwick) Introduction To Human Resource Ma
No ratings yet
4 (John Stredwick) Introduction To Human Resource Ma
61 pages
DS Notes BCA
No ratings yet
DS Notes BCA
16 pages
Daily
No ratings yet
Daily
1 page
App List Ta
No ratings yet
App List Ta
6 pages
Chapter 17 Trigonometric Ratios
No ratings yet
Chapter 17 Trigonometric Ratios
72 pages
MTech CSE Syllabus June2019
No ratings yet
MTech CSE Syllabus June2019
60 pages
SC Lab Manual 2017 18
No ratings yet
SC Lab Manual 2017 18
36 pages
ICSE Bluej Logical Programs
No ratings yet
ICSE Bluej Logical Programs
23 pages
Java Number Exercises
0% (1)
Java Number Exercises
24 pages
IY3 QTQ FM LJTJA0 LL
No ratings yet
IY3 QTQ FM LJTJA0 LL
5 pages
Nokia E72 UG Ar
No ratings yet
Nokia E72 UG Ar
147 pages
Sprayit Gravity Feed Spray Gun SPRAYIT
No ratings yet
Sprayit Gravity Feed Spray Gun SPRAYIT
8 pages
MCQ Bank For Promotion Test - UDC LDC Assistant DEO DPS Associate Steno
No ratings yet
MCQ Bank For Promotion Test - UDC LDC Assistant DEO DPS Associate Steno
354 pages
The Sugar Revolution Activity Part 1
No ratings yet
The Sugar Revolution Activity Part 1
7 pages
Belimo NMV-D3-MFT
No ratings yet
Belimo NMV-D3-MFT
10 pages
Covid Certificate
No ratings yet
Covid Certificate
1 page
Serrano v. Central Bank of The Philippines
No ratings yet
Serrano v. Central Bank of The Philippines
2 pages
Intermediate Microeconomics: Market Demand
No ratings yet
Intermediate Microeconomics: Market Demand
4 pages
Disopacija Pornih Pritisaka Delft
No ratings yet
Disopacija Pornih Pritisaka Delft
11 pages
ELECTIVE
No ratings yet
ELECTIVE
5 pages
BoardingCard 345631232 VNO BVA
No ratings yet
BoardingCard 345631232 VNO BVA
1 page
MANM519 - Week 3 AI Jobs and Future of Work - Lecture Notes
No ratings yet
MANM519 - Week 3 AI Jobs and Future of Work - Lecture Notes
12 pages
Led (0603) R
No ratings yet
Led (0603) R
11 pages
Ujian Bulan Mac: Bahasa Inggeris Kertas 1 Tahun 4
No ratings yet
Ujian Bulan Mac: Bahasa Inggeris Kertas 1 Tahun 4
9 pages
5 - MEP - Fire Protection-Rev
100% (11)
5 - MEP - Fire Protection-Rev
64 pages
CET415 - M2 - Ktunotes - in
No ratings yet
CET415 - M2 - Ktunotes - in
78 pages
Wilm's Tumor
100% (2)
Wilm's Tumor
17 pages
ZD 180B Rescue Chopper
No ratings yet
ZD 180B Rescue Chopper
20 pages
Ultrasonic Test Set Manual 42A12D
No ratings yet
Ultrasonic Test Set Manual 42A12D
7 pages
Business Intelligence For Big Data Analytics
No ratings yet
Business Intelligence For Big Data Analytics
8 pages
EL Form
No ratings yet
EL Form
1 page
Creative Arts Grade 6 Curriculum Design - 240115 - 133144
0% (1)
Creative Arts Grade 6 Curriculum Design - 240115 - 133144
57 pages
AWS Case Study Sumit
No ratings yet
AWS Case Study Sumit
8 pages
Bimaks Water Treatment Catalog 1718370506
No ratings yet
Bimaks Water Treatment Catalog 1718370506
28 pages
FM Midterm Chapter4
No ratings yet
FM Midterm Chapter4
13 pages
Establishing A TSFP
100% (1)
Establishing A TSFP
2 pages
On-Premise - PCE SAC Financial Planning
100% (2)
On-Premise - PCE SAC Financial Planning
71 pages
The Art of Startup Fundraising 1st Edition Alejandro Cremades Newest Edition 2025
100% (4)
The Art of Startup Fundraising 1st Edition Alejandro Cremades Newest Edition 2025
155 pages
Jurnal Geologi Struktur
No ratings yet
Jurnal Geologi Struktur
29 pages
Yeast - WPS Office
No ratings yet
Yeast - WPS Office
4 pages

Question On Data Mining

Uploaded by

Question On Data Mining

Uploaded by

1. What is R?

2. List and define some basic data types in R.

3. List and define some basic data structures in R.

4. How to import data in R?

5. What is a package in R, and how do you install and load packages?

6. How do you add a new column to a data frame in R?

Using the $ symbol:

df$col_3 <- c(5, 1, 18, 16)

Using square brackets:

df["col_3"] <- c(5, 1, 18, 16)

Using the cbind() function:

df <- cbind(df, col_3=c(5, 1, 18, 16))

8. How to create a user-defined function in R?

9.What is Data Mining?

10. What are the different tasks of Data Mining?

11. What is Classification?

12. What is Prediction?

13. What is Decision tree?

14. Explain Bayesian classification in Data Mining?

15. What do you understand by the term Cluster Analysis?

16. What is regression in Data mining?

17. What is KMeans clustering?

18. What is supervised learning?

19. What is unsupervised learning?

You might also like