0% found this document useful (0 votes)

12 views8 pages

Assignment 1

The document is an assignment for students at Presidency University, detailing various tasks involving the creation and manipulation of vectors and matrices using R programming. It includes exercises on statistical calculations, matrix operations, and data analysis with specific datasets. The assignment covers a wide range of topics, from basic vector creation to advanced statistical concepts and matrix algebra.

Uploaded by

Soumyadeep Majumdar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views8 pages

Assignment 1

Uploaded by

Soumyadeep Majumdar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Presidency University

Assignment 1
October, 2024

1. Create the following vectors:

(a) (1,3,5,7,......19)
(b) (1,2,3,.....10,9,8,7,.....,1)
(c) (2,5,7,2,5,7,2,5,7,2,5,7,2,5,7)
(d) (2,2,2,5,5,5,7,7,7)
(e) (2, 22/2 , 23/3 , 24/4 15
, ........2 /15)
(f) (0.12 0.98 , 0.13 0.97 ,0.14 0.96 ,..........,0.17 0.93 ,0.18 0.92 )

2. Find the sum:

(a) 1 + 23 + 23 . 45 + 23 . 54 . 67 + ... + 23 . 45 .... 38
39

100
X
(b) (i2 + i)
i=10
100 i
X 2
(c) and compare it with e2
i=0
i!

3. Run the following chunk of R code:

set.seed(50)
x=runif(250)
y=rnorm(250)

Suppose x = (x1 , x2 , ......., xn ) and y = (y1 , y2 , ......., yn ). Then

(a) Create the vector (y2 − x1 , y3 − x2 , ........, yn − xn−1 )

sin(y1 ) sin(y2 ) sin(yn−1 )
(b) Create the vector cos(x 2 ) , cos(x 3 ) , ........, cos(xn )
n−1
X e−xi+1
(c) Find
i=1
xi + 10

1
(d) Pick out the values in y which are > 1
(e) What are the index positions in y of the values which are > 1?
(f) What are the values in x which correspond to the values in y which
are > 1?
(g) How many values in x are more than 0.5?
(h) Sort the numbers in the vector x in the order of increasing values in
y.
(i) Pick out the elements in y at index positions 1, 4, 7, 10, 13, . . . .
and so on.
p q p
(j) Create the vector ( |x1 − x̄|, |x2 − x̄|, ......., |xn − x̄|) where x̄ is
the mean of xi ’s.
(k) Define zi = yi I(|yi | < 1). Find the mean and variance of z and y
and compare.
(l) How many x’s are more than y’s ?
(m) Round the vectors x and y to decimal places.
(n) Find the number of common elements in those rounded vectors.

In questions where you need to print long vectors as output, write only
the first few values as output in the answer. But the code should be for
generating the entire vector.
4. Execute the following lines of R code

set.seed(1)
x=sample(1:5,20,T)

(a) Convert a x into a factor.

(b) Rename the levels of the factor as Brand1, Brand2, Brand3, Brand4,
Brand5.
(c) Execute summary() over the factor explain what is the output.

5. Suppose  
1 1 3
A= 5 2 6
−2 −1 −3

(a) Check that A3 = 0.

(b) Replace the third column of A by the sum of the second and third
columns.

2
6. Create the following matrix B with 15 rows:
 
10 −10 10
B =  ... .. .. 

. .
10 −10 10

Calculate the matrix B T B.

7. Create a 6×6 matrix named null with every entry equal to 0. Check what
the functions row and col return when applied to null. Hence create the
6 × 6 matrix:  
0 1 0 0 0 0
 1 0 1 0 0 0
 
 0 1 0 1 0 0
 
 0 0 1 0 1 0
 
 0 0 0 1 0 1
0 0 0 0 1 0

8. Explore the help for the function outer . Hence create the following pat-
terned matrix:  
0 1 2 3 4
1 2 3 4 5
 
2 3 4 5 6
 
3 4 5 6 7
4 5 6 7 8

9. Create the following patterned matrices. In each case, your solution should
make use of the special form of the matrix—this means that the solution
should easily generalize to creating a larger matrix with the same structure
and should not involve typing in all the entries in the matrix.
 

0 1 2 3 4
 0 1 2 3 4 5 6 7 8 9
1 2 3 4 0 1 2 3 4 5 6 7 8 9 0
 
(b)  ... ... .. .. .. .. 
 
2 3 4 0 1
(a) 

. . . .
 
3 4 0 1 2 
8 9 0 1 2 3 4 5 6 7

4 0 1 2 3 9 0 1 2 3 4 5 6 7 8
 
0 8 7 6 5 4 3 2 1
1 0 8 7 6 5 4 3 2
 
2 1 0 8 7 6 5 4 3
 
3 2 1 0 8 7 6 5 4
 
4
(c)  3 2 1 0 8 7 6 5
5 4 3 2 1 0 8 7 6
 
6 5 4 3 2 1 0 8 7
 
7 6 5 4 3 2 1 0 8
8 7 6 5 4 3 2 1 0

3
(
0 a if i = j
10. If A = (a − b)In + b1n 1n , then An×n = ((aij )) where aij = .
6 j
b if i =

(a) Using the above formula by defining identity matrix and vectors con-
struct a 5 × 5 matrix A with a = 5 and b = 2.
(b) Find det(A) and trace(A)
(c) Find A−1 . Also extract the (5, 2)th element of A−1 and its minor.
(d) Solve the following system of linear equations

Ax = b
0
where b =(5, 2, 4, 1, 5) .

11. The spectral decomposition theorem for real symmetric matrix states that:
For real symmetric matrix A with eigenvalues λ1 , λ2 , ...., λn with corre-
sponding eigenvectors x1 , x2 , ..., xn , we have
n
X
A= λi xi xTi .
i=1

Using R, verify spectral decomposition theorem for the following matrix

 
1 2 3
2 4 5
3 5 2

12. Create a 6×10 matrix of random integers chosen from 1, 2,. . . , 10 by

executing the following two lines of code:

set.seed(75)
Mat<-matrix( sample(10, size=60, replace=T), nr=6)

(a) Find the number of entries in each row which are greater than 4.
(b) Which rows contain exactly two occurrences of the number seven?
(c) Find those pairs of columns whose total (over both columns) is greater
than 75. The answer should be a matrix with two columns; so, for
example, the row (1, 2) in the output matrix means that the sum of
columns 1 and 2 in the original matrix is greater than 75.

13. Using the function outer evaluate the following:

20 X
5
X i4
(a)
i=1 j=1
(3 + j)

4
20 X
5
i4
X
(b) (3+ij)
i=1 j=1

14. (Big Computing) The need of large computation is very common in most
of the contemporary applications as compared to 10 years ago and fortu-
nately we can manage most of them in R. Run the following piece of R
code

x= rexp(1100000)

(a) What is the length of this vector? Find the mean and variance of x.
(b) Find the mean of all of the entries in x which are strictly greater than
1.
(c) Plot a histogram of x.
(d) Add vertical lines each with different colors at the three quartiles.
Compare the relative positions of the quartiles and comment on the
skewness of the distribution.
(e) Create a matrix, X, containing the the values in x, with 32 rows and
34375 columns.
(f) Calculate the mean of the 371st column of X.
(g) Now, find the means for all 34375 columns of X simultaneously.
(h) Also find the standard deviation for the first 100 columns simultane-
ously.
(i) Use this matrix X as the input to the hist() function and save the
result to a variable of your choice. What does this new variable show?
(j) Now, find the means of all the columns of X simultaneously. Plot the
histogram of column means. Explain why its shape does not match
with the last histogram.
(k) We want to find the eigenvalue and eigenvectors of X T X. Use the
eigen() function to directly perform the eigen analysis of X T X.
What result do you get?
(l) State and use a fact of matrix algebra which helps in computing the
eigenvectors of X T X using the same eigen() function but with some
extra steps in computation. Why are some of the eigenvalues of X T X
zero?

15. The Cars93 data frame in the MASS package contains data on 93 makes
of car sold in USA.

(a) What are names of the variables in the data frame?

5
(b) What are the types of the variables?
(c) The variable Type classifies the type of market the car is aimed at.
In each type, find the cheapest car and the car with the greatest fuel
efficiency.
(d) Also for each type compute the mean horsepower and the difference
between each car’s horsepower and the mean horsepower for it’s type.
(e) Create two new data frames for US and non-US cars.
(f) Use write.table() to save the US car data to a file. Read it in and
check if all the factors are correctly set as factors.
(g) Use save() to save the non-US car data to a file.
(h) Search help to learn how to remove existing objects in R. Remove the
non-US data frame and load the non-US car data file using load().
Now check of all the factors are still set.

16. The data set at housing.csv contains information about the housing stock
of California and Pennsylvania, as of 2011. Information as aggregated into
“Census tracts”, geographic regions of a few thousand people which are
supposed to be fairly homogeneous economically and socially.

(a) (Import and scrutiny)

i. Load the data in R into a data frame called housing.
ii. What is the dimension of the dataset?
iii. Run this command, and explain, in words, what this does:

colSums(apply(housing,c(1,2),is.na))

iv. The function na.omit() omits any row containing an NA value.

Use it to eliminate rows with incomplete data. How many rows
did this eliminate? Is your answer compatible with the previous
one? Explain.
(b) The vacancy rate is the fraction of housing units which are not oc-
cupied. The data frame contains columns giving the total number of
housing units for each Census tract, and the number of vacant hous-
ing units. Add a new column to the data frame which contains the
vacancy rate. What are the minimum, maximum, mean, and median
vacancy rates?
(c) The column COUNTYFP contains a numerical code for counties
within each state. We are interested in Alameda County (county 1
in California), Santa Clara (county 85 in California), and Allegheny
County (county 3 in Pennsylvania).
i. What were the average percentages of housing built in these
counties since 2005?

6
ii. Calculate the median of house value for these counties.
iii. What is the correlation between median house value and the
percent of housing built since 2005 in
A. the whole data,
B. all of California,
C. all of Pennsylvania,
D. Alameda County,
E. Santa Clara County and
F. Allegheny County?
(d) i. The variable Built_2005_or_later indicates the percentage
of houses in each Census tract built since 2005. Plot median
house prices against this variable. Change the points sizes from
the default value.
ii. Make a new plot, or pair of plots, which breaks this out by state.
Note that the state is recorded in the STATEFP variable, with
California being state 6 and Pennsylvania state 42.
(e) The vacancy rate is the fraction of housing units which are not oc-
cupied. The dataframe contains columns giving the total number
of housing units for each Census tract, and the number of vacant
housing units.
i. Plot the vacancy rate against median house value.
ii. Plot vacancy rate against median house value separately for Cal-
ifornia and for Pennsylvania. Is there a difference?

17. Consider the following data on the severity of a crash tabulated for the
cases where the passenger had a seat belt or not:

Injury
None minimal minor major
Seat belt Yes 12813 647 359 42
No 65963 4000 2642 303

(a) Create an appropriate barplot showing the differences between those

who had seat belts and those who did not.
(b) Use identify() to interactively insert a legend.

18. (Finding the solution)

(a) Create a sequence of x values of length 100 from 1- to 1.

(b) Use plot() to draw the curve of y = ex between -1 and 1.
(c) Add the curve of y = sin(x) over the same domain on the previous
plot.

7
(d) The function text() is used to add some text anywhere in the plot-
ting area. Use this function to label the two curves appropriately.
The function expression() can be used to insert mathematical ex-
pressions as text. Use this to label y = ex curve.
(e) The function arrows() is used to add arrows to an existing plot.
Use arrows() and text() to locate the solution of sin(x) = ex . You
can take help of the locator() function which is used to get the
co-ordinates of any point in a plot interactively.

19. Run the following piece of R code:

n = 50
set.seed(0)
x = runif(n, min=-1, max=1)
y = x^3 + rnorm(n)

(a) Produce a scatterplot of x and y.

(b) Add the curve y = x3 in the plot and have the curve be drawn in red
with twice the normal thickness.
(c) Add a a straight horizontal line at 0 to the plot and have the line be
dashed.
(d) Define two new variables as

upper=x^3+qnorm(0.10)
lower=x^3-qnorm(0.10)

Add two new lines passing through the upper and lower points. These
lines are like the confidence intervals.
(e) Shade the area between the upper and lower lower bounds in gray.
[Hint: Use polygon(); this function requires that the x coordinates
of the polygon be passed in an appropriate order. You might find
it useful to set use c(x, rev(x)) for the x coordinates but need to
explain this command if you use it.]

Arunav Da Prac
No ratings yet
Arunav Da Prac
55 pages
Exercises Question
No ratings yet
Exercises Question
30 pages
HW1-Instructions and Questions
No ratings yet
HW1-Instructions and Questions
2 pages
First Course On R
No ratings yet
First Course On R
26 pages
CS2610 Final Exam: If Is - Nan Print
No ratings yet
CS2610 Final Exam: If Is - Nan Print
5 pages
R Programming Lab 2
No ratings yet
R Programming Lab 2
10 pages
R Assignment
No ratings yet
R Assignment
9 pages
R Exercises
No ratings yet
R Exercises
35 pages
Programming Exercises For R: by Nastasiya F. Grinberg & Robin J. Reed
50% (2)
Programming Exercises For R: by Nastasiya F. Grinberg & Robin J. Reed
35 pages
DA Practical File
No ratings yet
DA Practical File
36 pages
Rexercises 1 R Basic
No ratings yet
Rexercises 1 R Basic
35 pages
18 3 24 Upto Week 6 A B Latest 1
No ratings yet
18 3 24 Upto Week 6 A B Latest 1
25 pages
Big Data Lab R Code With Output
No ratings yet
Big Data Lab R Code With Output
13 pages
R Lectures 2
No ratings yet
R Lectures 2
31 pages
R Programs
No ratings yet
R Programs
12 pages
Exercises For R
No ratings yet
Exercises For R
40 pages
Practical 5 2
No ratings yet
Practical 5 2
7 pages
Essential R Commands Guide
No ratings yet
Essential R Commands Guide
11 pages
R Programming Exercises
No ratings yet
R Programming Exercises
38 pages
R Pres
No ratings yet
R Pres
53 pages
Workshop Activity: X Seq y Length
No ratings yet
Workshop Activity: X Seq y Length
3 pages
STAT-2450 Assignment 1: Name:, Student ID: B00
No ratings yet
STAT-2450 Assignment 1: Name:, Student ID: B00
9 pages
Advanced R Programming Tasks
No ratings yet
Advanced R Programming Tasks
40 pages
BD
No ratings yet
BD
12 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
8 pages
R Tutorial
No ratings yet
R Tutorial
32 pages
Parth Suryavanshi (231056) Practical No.1 To No.5
No ratings yet
Parth Suryavanshi (231056) Practical No.1 To No.5
37 pages
R Study Material I
No ratings yet
R Study Material I
8 pages
Nishant R File
No ratings yet
Nishant R File
49 pages
Session Set Working Directory Choose Directlry
No ratings yet
Session Set Working Directory Choose Directlry
17 pages
R Programming Practical File
100% (1)
R Programming Practical File
32 pages
R - Tutorial: Matrices Are Vectors
No ratings yet
R - Tutorial: Matrices Are Vectors
13 pages
An R Tutorial Starting Out
No ratings yet
An R Tutorial Starting Out
9 pages
Da Session 4
No ratings yet
Da Session 4
75 pages
R Examples
No ratings yet
R Examples
56 pages
Computational Techniques in Statistics: Exercise 1
No ratings yet
Computational Techniques in Statistics: Exercise 1
5 pages
R Session A
No ratings yet
R Session A
107 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
Rubric Quiz1
No ratings yet
Rubric Quiz1
2 pages
173 - Prabhakar Pal-R Assignment
No ratings yet
173 - Prabhakar Pal-R Assignment
9 pages
Applied Statistics MAT1011
No ratings yet
Applied Statistics MAT1011
22 pages
DAUR Lab Manual
No ratings yet
DAUR Lab Manual
14 pages
R Studio
No ratings yet
R Studio
8 pages
ProgrammingForDS14 Rbasics
No ratings yet
ProgrammingForDS14 Rbasics
32 pages
Ed 3
No ratings yet
Ed 3
26 pages
MAST30025: Linear Statistical Models: Week 2 Lab
No ratings yet
MAST30025: Linear Statistical Models: Week 2 Lab
7 pages
Analysis Report
No ratings yet
Analysis Report
8 pages
Week1 R Programming Questions
No ratings yet
Week1 R Programming Questions
3 pages
Welcome To Cmpe140 Final Exam: Studentid
No ratings yet
Welcome To Cmpe140 Final Exam: Studentid
21 pages
Certificate: Alard College of Business Studies
No ratings yet
Certificate: Alard College of Business Studies
55 pages
R Commands: Appendix B
No ratings yet
R Commands: Appendix B
5 pages
R Assignment 3-1
No ratings yet
R Assignment 3-1
3 pages
A Short List of Some Useful R Commands: Input and Display
No ratings yet
A Short List of Some Useful R Commands: Input and Display
2 pages
Linear Model 1
No ratings yet
Linear Model 1
71 pages
Linear Model Recap 2
No ratings yet
Linear Model Recap 2
313 pages
Introduction
No ratings yet
Introduction
47 pages
Assignment 1 New
No ratings yet
Assignment 1 New
6 pages
Prob Intro4
No ratings yet
Prob Intro4
277 pages
Assignment 6 New
No ratings yet
Assignment 6 New
3 pages
Linear Review 1
No ratings yet
Linear Review 1
235 pages
Prob Intro2
No ratings yet
Prob Intro2
224 pages
Basic Testing
No ratings yet
Basic Testing
116 pages
Tidy Verse
No ratings yet
Tidy Verse
76 pages
No Name 1
No ratings yet
No Name 1
1 page
Stative Verbs Chart
No ratings yet
Stative Verbs Chart
2 pages
Princeton Chromatography SFC & HPLC Solutions
No ratings yet
Princeton Chromatography SFC & HPLC Solutions
20 pages
Easy Tense Chart
No ratings yet
Easy Tense Chart
3 pages
17MU5A0305 Project Report
No ratings yet
17MU5A0305 Project Report
107 pages
FMCG Sales & Marketing Resume
No ratings yet
FMCG Sales & Marketing Resume
2 pages
01 Aen 17526 s17 Model Answer
No ratings yet
01 Aen 17526 s17 Model Answer
26 pages
Regedit XLR8 FFXX
No ratings yet
Regedit XLR8 FFXX
5 pages
5 Amazing Tech Projects To DO
No ratings yet
5 Amazing Tech Projects To DO
1 page
FC Model Trend Catalogue 2018
No ratings yet
FC Model Trend Catalogue 2018
37 pages
Digital Number Systems Guide
No ratings yet
Digital Number Systems Guide
12 pages
Homemade Fernet Recipe Guide
0% (1)
Homemade Fernet Recipe Guide
2 pages
VVTS
No ratings yet
VVTS
12 pages
XII Maths Project
No ratings yet
XII Maths Project
2 pages
Syllabus Isye6501
No ratings yet
Syllabus Isye6501
5 pages
M.SC., M.Ed.,Ph.D., P.G Assistant in Botany, Melsevalambadi - Villupuram Dist. - 9943437766
No ratings yet
M.SC., M.Ed.,Ph.D., P.G Assistant in Botany, Melsevalambadi - Villupuram Dist. - 9943437766
48 pages
2020 BBO Answers PDF
No ratings yet
2020 BBO Answers PDF
2 pages
The Classical Macro Model
No ratings yet
The Classical Macro Model
45 pages
Community Copy - Epic Legacy Tome of Titans - Vol. 2
91% (11)
Community Copy - Epic Legacy Tome of Titans - Vol. 2
499 pages
The Medical Science Liaison Career Guide How To Break Into Your First Role A Hiring Manager Reveals The Secrets For Success Official Test Bank
No ratings yet
The Medical Science Liaison Career Guide How To Break Into Your First Role A Hiring Manager Reveals The Secrets For Success Official Test Bank
402 pages
Belimo NMV-D3-MFT
No ratings yet
Belimo NMV-D3-MFT
10 pages
Android Debugging Errors
No ratings yet
Android Debugging Errors
32 pages
Valuing Options: Multiple Choice Questions
100% (1)
Valuing Options: Multiple Choice Questions
15 pages
Ramesh Chopra - Electronics Projects Volume 20 (2013, EFY Enterprises Pvt. LTD.)
100% (8)
Ramesh Chopra - Electronics Projects Volume 20 (2013, EFY Enterprises Pvt. LTD.)
200 pages
Science & Tourism Month 2024 Events
No ratings yet
Science & Tourism Month 2024 Events
3 pages
Fermentec 5L
No ratings yet
Fermentec 5L
3 pages
Class 12 Economics: Macroeconomics Quiz
No ratings yet
Class 12 Economics: Macroeconomics Quiz
6 pages
Nov 2022 - Dela Cruz F - Set 1
No ratings yet
Nov 2022 - Dela Cruz F - Set 1
2 pages
Orta Sevi̇yede İngi̇li̇zce Bi̇len Ana Di̇li̇ Türkçe Olan Öğrenci̇leri̇n Vücut
No ratings yet
Orta Sevi̇yede İngi̇li̇zce Bi̇len Ana Di̇li̇ Türkçe Olan Öğrenci̇leri̇n Vücut
163 pages
ZD 180B Rescue Chopper
No ratings yet
ZD 180B Rescue Chopper
20 pages
WWG - Maiden of The High Seas - Props - Instructions
No ratings yet
WWG - Maiden of The High Seas - Props - Instructions
22 pages
Grouting Around Power Tunnel Lining: Satish Kumar Sharma
No ratings yet
Grouting Around Power Tunnel Lining: Satish Kumar Sharma
12 pages

Assignment 1

Uploaded by

Assignment 1

Uploaded by

Presidency University

1. Create the following vectors:

2. Find the sum:

3. Run the following chunk of R code:

Suppose x = (x1 , x2 , ......., xn ) and y = (y1 , y2 , ......., yn ). Then

(a) Create the vector (y2 − x1 , y3 − x2 , ........, yn − xn−1 )

(a) Convert a x into a factor.

(a) Check that A3 = 0.

Calculate the matrix B T B.

Using R, verify spectral decomposition theorem for the following matrix

12. Create a 6×10 matrix of random integers chosen from 1, 2,. . . , 10 by

13. Using the function outer evaluate the following:

(a) What are names of the variables in the data frame?

(a) (Import and scrutiny)

iv. The function na.omit() omits any row containing an NA value.

(a) Create an appropriate barplot showing the differences between those

18. (Finding the solution)

(a) Create a sequence of x values of length 100 from 1- to 1.

19. Run the following piece of R code:

(a) Produce a scatterplot of x and y.

You might also like