KEMBAR78
Byte Academy: Data Science | PDF | Artificial Neural Network | Apache Hadoop
0% found this document useful (0 votes)
110 views11 pages

Byte Academy: Data Science

Byte Academy offers a 12-week Data Science course that covers Python fundamentals, statistics, data wrangling, visualization, machine learning algorithms, natural language processing, and deep learning. The course includes specializations in MedTech, FinTech, and social sciences. It utilizes tools like Python, NumPy, Pandas, scikit-learn, and TensorFlow. Students learn through projects, exercises, and examining real-world problems.

Uploaded by

Chris Morris
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
110 views11 pages

Byte Academy: Data Science

Byte Academy offers a 12-week Data Science course that covers Python fundamentals, statistics, data wrangling, visualization, machine learning algorithms, natural language processing, and deep learning. The course includes specializations in MedTech, FinTech, and social sciences. It utilizes tools like Python, NumPy, Pandas, scikit-learn, and TensorFlow. Students learn through projects, exercises, and examining real-world problems.

Uploaded by

Chris Morris
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

Byte Academy

Data Science

06/30/2017
Introduction

Byte Academy pioneered industry-focused programs beginning with the


launch of our FinTech course, the first of its type. Our educational programs
bridge the gap between “business” and technology. Offerings include
Python Fullstack Software Development, Data Science, FinTech,
Quant-Algos, Product Management and Blockchain.

We work with companies to develop and adapt our curriculum to teach


skill-sets that match real hiring needs. We also utilize the expertise of
leading business executives, industry professionals, and those passionate
about education and software development to craft each course. Many of
these individuals teach at Byte.

Our courses are open to individuals spanning from beginner to more


advanced programming backgrounds. Some may have already had industry
expertise or work experience while others are brand new to the domain that
they want to study. We provide scholarships to women and offer a tuition
deferral program in which eligible students pay us back after they get a job.

Course formats include onsite full-time, part-time and remote in addition


to customized corporate training.

For more information please see www.byteacademy.co

2
Data Science - Specialties

Our data science curriculum is designed with our students in mind. For those
interested in a specific domain of tech, we offer three specialities which
students can also declare during phase 2 of the program

Code to save lives. Recent years have brought significant


MedTech changes in the healthcare industry especially as health and
medical technology has developed. Areas of medical
technology include: Biomedical databases. Bioinformatics,
including statistics, mathematics and other data sciences.
Medical devices. Genomics.

FinTech Code for interesting problems. FinTech is a rapidly growing


industry that’s requiring more engineers to fit the industry’s
needs. In response to this transition of talent, we train
individuals how to program for financial domain. Students
graduate with the ability to analyze financial data.

Areas of FinTech Include: Peer to Peer Lending,


Cryptocurrency, Mobile Payments, Analytics, Developer APIs,
Cybersecurity, Cloud Security.

Social Code to make society better. This specialty explores


Sciences important questions about how data science will transform
society as a whole. The computational social sciences
generate interdisciplinary projects that utilize data science
tools to tackle policy problems.

Computational social sciences raise questions about the


politics and ethics of data science research, particularly when
it focuses on socio-political problems with applications in
government and the private sector.
3
Week 1 Week 1

Python Fundamentals Github Workflow & Integration


Data Types and Operators
Control Flow
Week 1 begins with
acquiring an in-depth Functions
knowledge of the Python Lambda Functions
programming language. By Decorator Functions
the end of the week,
students will be expected Object Oriented Programming
Classes & Inheritance
to program intermediate
level scripts in Python. Modules
Input/Output
Tools Utilized:
Exercises & Examination
Python

Week 2
Week 2
Statistics & Linear Algebra
Descriptive Statistics
Distributions & Histograms
Week 2 is dedicated to
creating a deep Cumulative Distribution Functions
understanding of Skewness
mathematical concepts
we’ll later see in topics like Conditional Probability
Bayes Theorem
machine learning and
Estimation
statistical analysis.
Contrary to the traditional Hypothesis Testing
mathematics course, Correlation
students will learn
statistics and linear algebra Vectors & Matrices
Matrix Operations
through a computational
lens.

Tools Utilized:

Numpy, SciPy
4
Week 3
Data Wrangling Week 3

Week 3 begins the true CSV, json, and zip files


start of industry focused APIs
tools, such as scrapy,
pandas, beautifulsoup, and HTML & CSS basics
Web Scraping
more. We’ll learn the
different tools in which we Data Frames
can acquire, prepare, clean, Data Merging
and manipulate data to fit a Data Normalization
problem’s needs.
Missing Values
Outlier Detection
Tools Utilized:
Exercises & Examination
Json, BeautifulSoup,
Scrapy, Requests, Pandas

Week 4 Week 4
Line and Scatter Plots
Data Visualization & Histograms
Exploratory Analysis Visualization Customization

On week 4 we’ll begin Seaborn & ggplot


Exploratory Computing
curriculum focused on
Box Whisker Plots
various data visualization Heatmaps
techniques and how they
can help us engage and Factorplots & FacetGrids
learn from our data. Advanced Graphs

Tools Utilized: Geospatial Visualizations


Map Visualizations
Matplotlib, Seaborn,
Ggplot, Geojsonio, Shapely, Exercises & Examination
Descartes

5
Week 5
Week 5 - Regression Analysis
Regression Analysis
Intro to Machine Learning
Types of Learning & Data
Week 5 begins the official
start of the statistical Maximum Likelihood
analysis and prediction Linear Regression
portion of this course. We’ll Multiple Linear Regression
spend week 5 engaging
Non-Linear Regression
with the basics of machine
Logistic Regression
learning and work our way
towards learning and Time Series Analysis
implementing several Stepwise Regression
regression models.
Ridge & Lasso Regression
Exercises & Examination

Tools Utilized:

scikit-learn

6
Weeks 6-7

Review of Bayes
Week 6-7 Naive Bayes & Joint Models

Machine Learning Classification


Support Vector Machines
Working off of week 5’s Mini Project Day with Kaggle
curriculums, we’ll enter
the sphere of machine K-Means Clustering
K Nearest Neighbors
learning algorithms,
including topics in both ROC Curves
supervised and Cross Validation
unsupervised learning.
These two weeks will be Decision Trees
Random Forests
heavily project and
exercise based, but with Optimization
the mathematical Regularization
implications of this work
heavily in mind. Project Day

Tools Utilized: Principle Component Analysis


Dimension Reduction
scikit-learn, caret
Boosting & Bagging
Examination

7
Weeks 8-9

Regular Expressions
Week 8-9 Components of Speech
Text Normalization
Natural Language Word Tagging
Processing & Deep
Sentiment Analysis
Learning Information Extraction
Named Entity Extraction
Once again expanding on
the knowledge gained from Topic Modeling
weeks 5-7, we will enter the Summarization
realm of machine learning
Neural Networks
involving textual analyses
BackProp & Gradient Descent
and artificial neural
networks. Because the two Mini Project
can be complementary, we
will also engage with topics Feedforward Neural Networks
like word2vec and more. Recurrent Neural Networks

Autoencoders & Embeddings


Vector Space Models
Word2Vec
Tools Utilized:
Mini Project
re & nltk, gensim,
Tensorflow, Theano, Keras Convolution Neural Networks
LSTM Networks

GPUs & Hardware


Examination

8
Week 10
Week 10 - Databases
Databases
Intro to Databases
Week 10 dives into SQL Basics
database systems headfirst. Database Modeling
During this week,
Advanced SQL
becoming fluent with SQL,
Database Design
NoSQL, and MySQL is a
must that will carry over NoSQL
into the rest of this course. MongoDB

Tools Utilized: MySQL


PostgreSQL
SQLite, mongoDB,
PostgreSQL Exercises & Examination

Week 11
Week 11 - Big Data
Big Data
MapReduce
With the consistent Hadoop
growing of data every day, Spark
engineers are forced to Hadoop Ecosystem
become equipped to
handle, prepare, and Kafka
process this data in a Storm
computationally efficient
Amazon Web Services
manner. This week reviews Cloud Computing
the different big data
architecture tools available Project Day
in the data science industry
today.

Tools Utilized:

AWS, hadoop, docker,


Kafka, Spark, Storm
9
Week 12-14 We emphasize projects and pair-programming so
that you graduate with a portfolio to showcase to
potential employers, as well as to prove to them
that you can develop in a team environment similar
Final Project to the real world.

Students are required to use at least three of the


technologies we used throughout the course.
Projects for this program last about three weeks
that will ultimately result in a demo for others to see.

As part of the final project, students will be guided


by instructors at Byte Academy, as well as working
professionals in the field. We believe this heavy
mentorship will provide students with the proper
support for success, while simultaneously ramping
the skillset developed throughout the fourteen
weeks of our course.

10
More information?

info@byteacademy.co

www.byteacademy.co
Licensed by NY State Dept of Education

11

You might also like