A REPORT
On
INTERNSHIP
Name of the Student: Gayathri Karri
Name of the College: Vignan’s Institute of Information Technology (A)
Registration Number: 21L31A0598
Period of Internship: 25-04-2024 – 25-05-2024
Year: III B.Tech
Name and Address of the Intern Organization: CodSoft, Kolkata West
Bengal
An Internship Report
on
DATA SCIENCE
Submitted in partial fulfilment of the requirements for the award of the Summer Internship of
Bachelor of Technology
In
Department
COMPUTER SCIENCE & ENGINEERING
By
Name of Student
(Roll No. 21L31A0598)
Under the Faculty Guidance of
Faculty
Designation
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
VIGNAN’S INSTITUTE OF INFORMATION TECHNOLOGY (A)
DUVVADA, VISAKHAPATNAM
JUNE, 2024
Vignan’s Institute of Information Technology (A)
Department of Computer Science and Engineering
Student Declaration
I, Gayathri Karri, am 3rd year student of Bachelor of Technology with Reg. No.:
21L31A0598 in the department of Computer Science and Engineering, Vignan’s Institute of
Information Technology (A), Duvvada, Visakhapatnam. I hereby declare that the presented
report of the internship titled " Data Science” is uniquely prepared by me after successful
completion of a summer internship from 25th may 2024 to 25th June 2024 in “CodSoft” under
the faculty guidance of ________________________________, Assistant Professor,
department of Electronics and communication Engineering, Vignan’s Institute of Information
Technology (A), Duvvada, Visakhapatnam, during the academic year 2023-2024.
I also confirm that the report is only prepared for my academic requirement, not for
any other purpose. It might not be used in the interest of the opposite party of the corporation.
-----------------------------
Name of Student: Gayathri Karri
Regd. No.: 21L31A0598
Department of Computer Science and Engineering
CERTIFICATE
This is to certify that Gayathri Karri bearing Regd. No 21L31A0598 has completed the
internship at CodSoft on Data Science under the faculty guidance of
_____________________________, Assistant Professor. He is also submitted the internship
report to the department during the academic year 2023-2024, in partial fulfilment of the
requirements for the award of the Summer Internship of Bachelor of Technology in the
department of Computer Science and Engineering, Vignan’s Institute of Information
Technology (A), Duvvada, Visakhapatnam.
-------------------------------------- --------------------------------------
Faculty Supervisor Dept. Internship coordinator
Name: Name:
-------------------------------------- --------------------------------------
Head of the Department Head, Internships
Name: Name:
ABSTRACT
This report outlines the comprehensive learning experience and professional development
gained during a data science internship at CodSoft Organization. Over the course of the
internship, a blend of theoretical knowledge and practical skills were cultivated through
hands-on projects, collaborative tasks, and mentorship. Key responsibilities included data
cleaning, analysis, visualization, and the application of machine learning algorithms to
real-world datasets. The internship provided exposure to advanced tools and technologies
such as Python, R, SQL, and various data visualization libraries. Additionally, the
collaborative environment fostered effective communication skills and the ability to work
within a team setting. The experience was instrumental in bridging the gap between
academic knowledge and industry practices, significantly enhancing both technical
competencies and professional acumen. This report provides a detailed account of the
tasks undertaken, the skills developed, and the overall impact of the internship on my
career trajectory in data science.
Gayathri Karri
Name: GAYATHRI KARRI
Register No.: 21L31A0598
Semester: 7
Branch: CSE
Section : 2
Date:
Acknowledgements
My heartful thanks to our internship mentor _____________________, who took the
responsibility to monitor all my daily attendance and Weekly report patiently.
My heartful thanks to _____________________, Internship Incharge, CEO at
_____________________, who guided me by taking class and let me carefully visit the
practical sessions.
My heartful thanks to _____________________, Associate Professor Department Internship
Coordinator, Vignan’s Institute of Information Technology who helped me in every aspect of
gathering information about internship and guide me every day on proper submission of
reports.
My heartful thanks to _____________________, Associate Professor, Head of the Department
Electronics and Communication engineering department, for providing me with all the
Information and advising about different companies and analyzing them in the better way.
My best regard to _____________________, Professor, Head interns, Dean of T&P Cell for
providing me this internship opportunity towards better placements in different companies.
My special thanks to our Principal _____________________, Professor for following me to
participate in the summer internship programme on behalf of our college to gain industrial
knowledge and experience.
Contents
Sl.No Content PageNo
1 Title Page 1
2 Certification 2
3 Certificate From Intern Organization 3
4 Acknowledgment 4
5 Abstract 5
6 Contents 6
7 Activity Log for Week-1 7
8 Weekly Report for Week-1 7
9 Activity Log for Week-2 8
10 Weekly Report for Week-2 8-9
11 Activity Log for Week-3 9
12 Weekly Report for Week-3 10-11
13 Activity Log for Week-4 11
14 Weekly Report for Week-4 12
15 Output Photos for tasks given 16-21
16 Chapter-1: Executive Summary 22
17 Chapter-2: Overview of the Organization 23
18 Chapter-3: Internship Part 24
19 Chapter-4: Future Scope 25
20 References 26
CHAPTER 1: EXECUTIVE SUMMARY
This executive summary encapsulates the key experiences and accomplishments during my
one-month online data science internship at CodSoft Organization. The internship focused on
developing practical data science skills through various projects, including Titanic survival
prediction, Iris flower classification, and sales prediction.
Learning Objectives and Outcomes
1.Enhance Programming Skills
2. Master Data Cleaning and Preprocessing
3. Develop Machine Learning Models
4.Improve Data Visualization Skills
5. Gain Remote Work Experience
Sector of Business and Intern Organization
CodSoft Organization operates in the technology sector, specializing in providing data-driven
solutions and software development services. The company focuses on leveraging data
science and machine learning to solve complex business problems, enhance decision-making
processes, and drive innovation across various industries.
Summary of Activities
During the internship, the following activities were undertaken:
1. Titanic Survival Prediction:
Task: Data cleaning, feature engineering, model building (logistic regression, decision
trees), and evaluation.
Outcome: Developed models to predict passenger survival with high accuracy, gaining
insights into model performance metrics.
2.Iris Flower Classification:
Tasks: Data preprocessing, model training using KNN and SVM, and performance
assessment.
Outcome: Successfully classified iris species, enhancing understanding of classification
techniques and their applications.
3. Sales Prediction:
Tasks: Data exploration, linear regression model development, and prediction accuracy
evaluation.
Outcome: Created a predictive model for sales forecasting, applying regression analysis to
real-world data.
CHAPTER 2: OVERVIEW OF THE ORGANIZATION
Introduction of the Organization
CodSoft Organization, founded in 2015, is a leading technology company specializing in
data-driven solutions and software development. It leverages cutting-edge technologies to
solve complex business challenges and drive innovation across various sectors, focusing on
data science,Web development machine learning, AI, and cloud computing.
B. Vision, Mission, and Values of the Organization
Vision: To be a global leader in technology innovation, driving digital transformation and
delivering impactful solutions.
Mission: Empower businesses with advanced technological solutions for data-driven
decision-making and sustainable growth.
C. Policy of the Organization in Relation to the Intern Role
CodSoft's internship policy emphasizes structured learning, mentorship, regular feedback,
and hands-on experience. Interns engage in meaningful projects, guided by experienced
mentors to enhance their professional development.
D. Organizational Structure
CodSoft’s structure includes:
Executive Leadership: CEO, CTO, and senior executives.
Departments: R&D, Data Science and Analytics, Software Development, Sales and
Marketing, Human Resources, and Finance and Administration.
E. Future Plans of the Organization
CodSoft plans to:
- Establish innovation labs for AI and blockchain.
- Expand into Latin America and Africa.
- Launch new cybersecurity and automation products.
- Invest in employee training and talent development.
- Implement sustainability initiatives for environmental impact.
CHAPTER 3: INTERNSHIP PART
During my one-month tenure as a Data Science intern at CodSoft, I was immersed in a rich
learning environment facilitated by the online platform. Weekly sessions spanning several
hours delved into various aspects of data science, from fundamental concepts to advanced
techniques, providing me with a comprehensive understanding of the field.
Working alongside experienced mentors, I engaged in practical exercises aimed at honing my
skills in data manipulation, analysis, and visualization. These sessions were highly
interactive, allowing me to apply Python libraries and SQL queries to handle datasets
effectively. Moreover, the emphasis on Exploratory Data Analysis (EDA) equipped me with
the necessary tools to extract meaningful insights from raw data, laying a solid foundation for
subsequent model building.
The latter half of the internship was dedicated to project submissions, where I had the
opportunity to apply my newfound knowledge to real-world scenarios. From predicting
Titanic survival rates to classifying iris flower species, each project challenged me to
leverage machine learning algorithms and techniques to solve complex problems.
One of the most rewarding aspects of the internship was the collaborative learning
environment fostered by CodSoft. Regular discussions with peers and mentors not only
enhanced my technical skills but also exposed me to diverse perspectives and approaches in
data science. The supportive guidance provided by CodSoft's team ensured that I was able to
navigate through challenges effectively and grow professionally.
Beyond technical skills, the internship at CodSoft provided me with a deeper appreciation for
the importance of data quality and preprocessing in the model development process. By
gaining hands-on experience in model evaluation and performance metrics, I developed a
holistic understanding of the data science lifecycle.
In conclusion, my experience as a Data Science intern at CodSoft was transformative,
equipping me with the skills and confidence to pursue a career in this dynamic field. The
structured curriculum, hands-on projects, and collaborative learning environment have
undoubtedly accelerated my growth as a budding data scientist.
ACTIVITY LOG FOR WEEK-1
Person In-
Brief description of Learning
Day Date Charge
the daily activity Outcome
Signature
25-04-2024 Understanding the Understood the
requirements of the task skills required to
allocated and work with the task
establishment of
environment for working
Day-1
with the tasks.
26-04-2024 Applying preprocessing Performed
techniques to work with datacleaning and
the dataset given removed all the
missing data and
outliers from the
Day-2
data
27-04-2024 Perform Exploratory Data Performed EDA
Analysis on the data and replaced some
of the missing
values with
Day-3 mode,mean,medi
a
29-04-2024 Data Visualization Visualized the data
set given using
different plots
available in
matplot lib and
Day-4
seaborn
30-04-2024 Model Selection Selected a
particular model
and fitted for he
data set available
and performd
Day-5
predictions
Day-6 1-05-2024 Accuracy Metrics Evaluated the
model and
predicted the
accuracy
WEEKLY REPORT
WEEK – 1 (From Date: 25-04-2024 to Date: 01-05-2024)
Objective of the Activity Done:
Develop a Titanic survival prediction model, covering data understanding,
preprocessing, analysis, visualization, model selection, and GUI development.
Detailed Report:
Day 1 (25-04-2024):
Activity: Understanding task requirements and setting up the working
environment.
Outcome: Understood necessary skills and prepared the setup.
Day 2 (26-04-2024):
Activity: Data preprocessing.
Outcome:Cleaned data by removing missing values and outliers.
Day 3 (27-04-2024):
Activity: Exploratory Data Analysis (EDA).
Outcome: Analyzed data, handled missing values with mode, mean, and median.
Day 4 (29-04-2024):
Activity: Data visualization.
Outcome: Visualized data using Matplotlib and Seaborn.
Day 5 (30-04-2024):
Activity: Model selection.
Outcome: Selected and evaluated a model, performed prediction.
Day 6 (01-05-2024):
Activity: Accuracy metrics
Outcome: Used different accuracy methods to check the rate of true positives in
the data
ACTIVITY LOG FOR WEEK-2
Day Date Brief description of Learning Person In-Charge
the daily activity Outcome Signature
02-05-2024 Understanding the Understood the
requirements of the skills required to
task allocated and work with the task
establishment of
environment for
Day-7
working with the
tasks.
03-05-2024 Applying Performed
preprocessing datacleaning and
techniques to work removed all the
with the dataset given missing data and
outliers from the
Day-8
data
04-05-2024 Perform Exploratory Performed EDA
Data Analysis on the and replaced some
data of the missing
values with
Day-9 mode,mean,medi
a
06-05-2024 Data Visualization Visualized the
data set given
using different
plots available in
matplot lib and
Day-10
seaborn
07-05-2024 Model Selection Selected a
particular model
and fitted for he
data set available
and performd
Day-11
predictions and
calculated
accuracy
08-05-2024 Accuracy Metrics Evaluated the
model and
predicted the
accuracy
Day-12
WEEKLY REPORT
WEEK – 2 (From Date: 02-05-2024to Date: 08-05-2024)
Objective of the Activity Done:
Develop a Iris Flower Classification, covering data understanding, preprocessing,
analysis, visualization, model selection, and GUI development.
Detailed Report:
Day 1 (02-05-2024):
Activity: Understanding task requirements and setting up the working
environment.
Outcome: Understood necessary skills and prepared the setup.
Day 2 (03-052024):
Activity: Data preprocessing.
Outcome:Cleaned data by removing missing values and outliers.
Day 3 (04-05-2024):
Activity: Exploratory Data Analysis (EDA).
Outcome: Analyzed data, handled missing values with mode, mean, and median.
Day 4 (06-05-2024):
Activity: Data visualization.
Outcome: Visualized data using Matplotlib and Seaborn.
Day 5 (07-05-2024):
Activity: Model selection.
Outcome: Selected and evaluated a model, performed predictions, and calculated
accuracy.
Day 6 (08-05-2024):
Activity: Accuracy metrics
Outcome: Used different accuracy methods to check the rate of true positives in
the data
ACTIVITY LOG FOR WEEK-3
Day Date Brief description of the Learning Person In-Charge
daily activity Outcome Signature
09-05-2024 Understanding the Understood
requirements of the task the skills
allocated and required to
establishment of work with the
environment for working task
Day-13
with the tasks.
10-05-2024 Applying preprocessing Performed
techniques to work with datacleaning
the dataset given and removed
all the missing
data and
Day-14
outliers from
the data
11-05-204 Perform Exploratory Data Performed
Analysis on the data EDA and
replaced some
of the missing
values with
Day-15
mode,mean,m
edia
13-05-2024 Data Visualization Visualized the
data set given
using different
plots available
in matplot lib
Day-16
and seaborn
14-05-2024 Model Selection Selected a
particular
model and
fitted for he
data set
Day-17
available and
performd
predictions
15-05-2024 Accuracy Metrics Evaluated the
model and
Day-18 predicted the
accuracy
WEEKLY REPORT
WEEK – 2 (From Date: 02-05-2024to Date: 08-05-2024)
Objective of the Activity Done:
Develop a Iris Flower Classification, covering data understanding, preprocessing,
analysis, visualization, model selection, and GUI development.
Detailed Report:
Day 1 (02-05-2024):
Activity: Understanding task requirements and setting up the working
environment.
Outcome: Understood necessary skills and prepared the setup.
Day 2 (03-052024):
Activity: Data preprocessing.
Outcome:Cleaned data by removing missing values and outliers.
Day 3 (04-05-2024):
Activity: Exploratory Data Analysis (EDA).
Outcome: Analyzed data, handled missing values with mode, mean, and median.
Day 4 (06-05-2024):
Activity: Data visualization.
Outcome: Visualized data using Matplotlib and Seaborn.
Day 5 (07-05-2024):
Activity: Model selection.
Outcome: Selected and evaluated a model, performed predictions, and calculated
accuracy.
Day 6 (08-05-2024):
Activity: Accuracy metrics
Outcome: Used different accuracy methods to check the rate of true positives in
the data
ACTIVITY LOG FOR WEEK-4
Brief description of Learning Person In-Charge
Day Date
the daily activity Outcome Signature
16-05-2024
Day-19
18-05-2024
Day-20
20-05-2024
Day-21
21-05-2024
Day-22
23-05-2024
Day-23
24-05-2024
Day-24
WEEKLY REPORT
WEEK – 4 (From Date: __________ toDate: __________)
Objective of the Activity Done:
Detailed Report:
CHAPTER 4: OUTCOMES DESCRIPTION
Describe the work environment you have experienced (in terms of people interactions,
facilities available and maintenance, clarity of job roles, protocols, procedures, processes,
discipline, time management, harmonious relationships, socialization, mutual support and
teamwork, motivation, space and ventilation, etc.)
Describe the real time technical skills you have acquired (in terms of the job-related skills
and hands on experience)
Describe the managerial skills you have acquired (in terms of planning, leadership, team
work, behavior, workmanship, productive use of time, weekly improvement in competencies,
goal setting, decision making, performance analysis, etc.
Describe how you could improve your communication skills (in terms of improvement in
oral communication, written communication, conversational abilities, confidence levels
while communicating, anxiety management, understanding others, getting understood by
others, extempore speech, ability to articulate the key points, closing the conversation,
maintaining niceties and protocols, greeting, thanking and appreciating others, etc.,)
Describe how you could enhance your abilities in group discussions, participation in teams,
contribution as a team member, leading a team/activity.
Describe the technological developments you have observed and relevant to the subject
area of training (focus on digital technologies relevant to your job role)
Internship Completion Certificate, Photo with geo tag and Video
Links
A.(in this page insert One Photo with external
mentor and photo with internal mentor at
company location (GPS)) remove red color
text after adding photo …….
B. Certificate
C. Video link (if available)