Simplilearn
Program Course End Projects Capstone
Caltech Post Graduate 1. Feature Engineering 1. EdTech
Program in AI and
Machine Learning While searching for the dream house, the buyer Simplilearn would like to assess the quality
looks at various factors, not just at the height of the of E-Learning videos freely available on
basement ceiling or the proximity to an east-west YouTube. This would give them ideas on
railroad. preparing their video content, which is more
Using the dataset, find the factors that influence engaging with the students. They have
price negotiations while buying a house. chosen handpicked playlists corresponding
There are 79 explanatory variables describing every to various Computer Science Subjects from
aspect of residential homes in Ames, Iowa. an NPTEL channel as a pilot study. Videos
will be assessed on various fronts like
2. Customer Service Requests Analysis
instructor presence in the video, body
language, use of blackboard, use of slides,
You've been asked to perform data analysis of
etc.
service request (311) calls from New York City.
You've also been asked to utilize data wrangling
2. Healthcare
techniques to understand the pattern in the data and
visualize the major types of complaints.
ICMR wants to analyze different types of
3. Mercedes-Benz Greener Manufacturing cancers, such as breast cancer, renal
cancer, colon cancer, lung cancer, and
Since the first automobile, the Benz Patent Motor prostate cancer becoming a cause of worry
Car in 1886, Mercedes-Benz has stood for important in recent years. The input dataset contains
automotive innovations. These include the 802 samples for the corresponding 802
passenger safety cell with a crumple zone, the people who have been detected with
airbag, and intelligent assistance systems. different types of cancer. Each sample
Mercedes-Benz applies for nearly 2000 patents per contains expression values of more than
year, making the brand the European leader among 20K genes.
premium carmakers. Mercedes-Benz is the leader in Samples have one of the types of tumors:
the premium car industry. With a huge selection of BRCA, KIRC, COAD, LUAD, and PRAD
features and options, customers can choose the
customized Mercedes-Benz of their dreams. 3. Cyber Security
To ensure the safety and reliability of every unique Book-My-Show will enable the ads on their
car configuration before they hit the road, the website, but they are also very cautious
company’s engineers have developed a robust about their user privacy and information
testing system. As one of the world’s biggest about who visits their website. Some ads
manufacturers of premium cars, safety and URL could contain a malicious link that can
efficiency are paramount on Mercedes-Benz’s trick any recipient and lead to a malware
production lines. However, optimizing the speed of installation, freezing the system as part of a
their testing system for many possible feature ransomware attack or revealing
combinations is complex and time-consuming sensitive information.
without a powerful algorithmic approach.
You are required to reduce the time that cars spend
Simplilearn
on the test bench. Others will work with a dataset
representing different permutations of features in a
Mercedes-Benz car to predict the time it takes to
pass testing. Optimal algorithms will contribute to
faster testing, resulting in lower carbon dioxide
emissions without reducing Mercedes-Benz’s
standards.
4. Income Qualification
Many social programs have a hard time ensuring
that the right people are given enough aid. It’s tricky
when a program focuses on the poorest segment of
the population. This segment of the population can’t
provide the necessary income and expense records
to prove that they qualify.
In Latin America, a popular method called Proxy
Means Test (PMT) uses an algorithm to verify
income qualification. With PMT, agencies use a
model that considers a family’s observable
household attributes like the material of their walls
and ceiling or the assets found in their homes to
classify them and predict their level of need.
While this is an improvement, accuracy remains a
problem as the region’s population grows and
poverty declines.
The Inter-American Development Bank
(IDB)believes that new methods beyond traditional
econometrics, based on a dataset of Costa Rican
household characteristics, might help improve
PMT’s performance.
5. Healthcare
Cardiovascular diseases are the leading cause of
death globally. It is therefore necessary to identify
the causes and develop a system to predict heart
attacks in an effective manner. The data below has
the information about the factors that might have an
impact on cardiovascular health.
Simplilearn
6. Book Rental Recommendation
Book Rent is the largest online and offline book
rental chain in India. They provide books of various
genres, such as thrillers, mysteries, romances, and
science fiction. The company charges a fixed rental
fee for a book per month. Lately, the company has
been losing its user base. The main reason for this
is that users are not able to choose the right books
for themselves. The company wants to solve this
problem and increase its revenue and profit.
7. Lending Club Loan Data Analysis
For companies like Lending Club correctly predicting
whether or not a loan will be a default is very
important. In this project, using the historical data
from 2007 to 2015, you have to build a deep
learning model to predict the chance of default for
future loans. As you will see later this dataset is
highly imbalanced and includes a lot of features that
make this problem more challenging.
8. House Loan Data Analysis
For safe and secure lending experience, it's
important to analyze the past data. In this project,
you have to build a deep learning model to predict
the chance of default for future loans using the
historical data. As you will see, this dataset is highly
imbalanced and includes a lot of features that make
this problem more challenging.
9. Pet Classification Model Using CNN
Build a CNN model that classifies the given pet
images correctly into dog and cat images.
The project scope document specifies the
requirements for the project “Pet Classification
Model Using CNN.” Apart from specifying the
functional and non-functional requirements for the
project, it also serves as an input for project
scoping.
Simplilearn
10. Perform Facial Recognition with Deep
Learning in Keras Using CNN
Facial recognition is a biometric alternative that
measures unique characteristics of a human
face. Applications available today include flight
check in, tagging friends and family members in
photos, and “tailored” advertising. You are a
computer vision engineer who needs to develop a
face recognition programme with deep convolutional
neural networks.
11. Train and Deploy a CNN Model Using
TensorFlow Serving
You’re a Computer Vision Engineer at health.ai.
Your company is developing a deep learning
application to automate the detection of diabetic
retinopathy. The company is sourcing
high-resolution retina image data from various
clinical partners but the dataset is expected to be
huge and cannot be stored on a central system.
You’re asked to build a proof of concept using the
Kaggle retinopathy dataset to train a CNN model
with the Mirrored Strategy and deploy it with
TensorFlow Serving.
12. Emotion Recognition
Future customizations, such as understanding
human emotions, could lead to a range of
advancements, such as determining whether a
person likes a specific statement, item or product,
food, or how they are feeling in a particular
circumstance, and so on.
13. Detection of Lung Infection
Artificial Intelligence has evolved a lot and is
currently able to solve problems that are very
complex and require human specialization. One
such area is healthcare.
A lot of research happens every day to use deep
learning for the betterment of humanity, and one
such is healthcare.