Step 1: Learn Programming with R or Python
Before you can learn and use statistical methods in data science, you should
be proficient in a programming language, preferably Python or R. They’re
both popular, have a large community of users and an ecosystem of libraries
for specialized tasks.
So which language should you choose: Python or R?
If you want to explore a career in both data and software development in
general, you can learn Python. If you want to double down on a more
statistics-first role, learning R can be helpful. But if you’re new to
programming in general, I recommend starting with Python.
What You Should Learn
When learning R, focus on the following:
Basic Syntax: Understand variables, data types, loops, and
conditionals.
Data Structures: Learn to work with built-in Python data
structures like lists, dictionaries, and tuples; Vectors and
data frames in R.
Libraries: Familiarize yourself with key libraries for data
science such as pandas, NumPy, SciPy, statsmodels, and
Seaborn for Python. If you’re using R, learn to work with
dplyr and ggplot2.
Please go through the Website I sent for Basic syntax only and
others
SYNOPSIS ON R Programming
1. Expressions and Commands:
R can be used interactively at the command prompt or through
scripts. Commands are executed line by line, and results are
displayed in the console.
2. Variables and Assignment:
Variables are used to store data. The assignment operator <- (or =) is
used to assign values to variables.
3. Data Types:
R supports various data types, including numeric (integers,
doubles), character (strings), logical (TRUE/FALSE), and complex.
4. Comments:
Comments are used to explain code and improve readability. They
are ignored by the R interpreter and begin with a # symbol.
5. Functions:
Functions are blocks of organized, reusable code that perform a
specific task. They take input arguments and produce output.
6. Case Sensitivity:
R is case-sensitive, meaning my_variable and My_Variable are treated as
different entities.
7. Object Naming Rules:
Variable and function names can contain letters, numbers, and
underscores, but cannot start with a number or contain special
characters like +, -, *, /. Avoid using reserved keywords.