KEMBAR78
Statistics Lecture Note 'Cont | PDF
0% found this document useful (0 votes)
11 views2 pages

Statistics Lecture Note 'Cont

To learn data science, it's essential to be proficient in a programming language, with Python recommended for beginners and R for a statistics-focused approach. Key learning areas include basic syntax, data structures, and libraries specific to each language. The document also outlines fundamental concepts in R programming, such as expressions, variables, data types, comments, functions, case sensitivity, and naming rules.

Uploaded by

daloziee21
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views2 pages

Statistics Lecture Note 'Cont

To learn data science, it's essential to be proficient in a programming language, with Python recommended for beginners and R for a statistics-focused approach. Key learning areas include basic syntax, data structures, and libraries specific to each language. The document also outlines fundamental concepts in R programming, such as expressions, variables, data types, comments, functions, case sensitivity, and naming rules.

Uploaded by

daloziee21
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Step 1: Learn Programming with R or Python

Before you can learn and use statistical methods in data science, you should
be proficient in a programming language, preferably Python or R. They’re
both popular, have a large community of users and an ecosystem of libraries
for specialized tasks.

So which language should you choose: Python or R?

If you want to explore a career in both data and software development in


general, you can learn Python. If you want to double down on a more
statistics-first role, learning R can be helpful. But if you’re new to
programming in general, I recommend starting with Python.

What You Should Learn

When learning R, focus on the following:

 Basic Syntax: Understand variables, data types, loops, and


conditionals.
 Data Structures: Learn to work with built-in Python data
structures like lists, dictionaries, and tuples; Vectors and
data frames in R.
 Libraries: Familiarize yourself with key libraries for data
science such as pandas, NumPy, SciPy, statsmodels, and
Seaborn for Python. If you’re using R, learn to work with
dplyr and ggplot2.

Please go through the Website I sent for Basic syntax only and
others

SYNOPSIS ON R Programming

1. Expressions and Commands:


R can be used interactively at the command prompt or through
scripts. Commands are executed line by line, and results are
displayed in the console.

2. Variables and Assignment:


Variables are used to store data. The assignment operator <- (or =) is
used to assign values to variables.

3. Data Types:
R supports various data types, including numeric (integers,
doubles), character (strings), logical (TRUE/FALSE), and complex.

4. Comments:
Comments are used to explain code and improve readability. They
are ignored by the R interpreter and begin with a # symbol.

5. Functions:
Functions are blocks of organized, reusable code that perform a
specific task. They take input arguments and produce output.

6. Case Sensitivity:
R is case-sensitive, meaning my_variable and My_Variable are treated as
different entities.

7. Object Naming Rules:


Variable and function names can contain letters, numbers, and
underscores, but cannot start with a number or contain special
characters like +, -, *, /. Avoid using reserved keywords.

You might also like