KEMBAR78
ICT550 Lab Assignment 2 | PDF | Data | Information Science
0% found this document useful (0 votes)
87 views2 pages

ICT550 Lab Assignment 2

Uploaded by

2023197621
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
87 views2 pages

ICT550 Lab Assignment 2

Uploaded by

2023197621
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

ICT550-Principles of Data Management

LAB ASSIGNMENT 2 – SENTIMENT ANALYSIS


(10%)
Week 7

Customers usually talk about products on social media and customer feedback forums. This data
can be collected and analysed to gauge overall customer response. Taking this a step further,
trends in the data can also be examined. For example, customers of a certain age group and
demographic may respond more favourably to a certain product than others. Based on the
information collected, companies can then position the product differently or change their target
audience.

Tasks:
1. Identify a large dataset which consist of public opinions or sentiments on any
topics/issues such as online products reviews (e.g, movies, mobile phones), hotel
reviews, political and financial analysis.

2. Analyse the public opinions about the topics/issues. Use phyton to classify the
sentiment. Submit a report that discusses your findings on sentiment analysis.

The analysis must consist of:


a) Data Definition

b) Data Quality Assessment


 Profiling
 Parsing and standardization
 Generalized "cleansing“
 Matching
 Monitoring

c) Problem resolution
The file you downloaded may contain extra or useless fields of data. It is your job to
analyze what you want to represent. Just like the curator of a museum makes decision
about what to show and how to display it, the data you choose to display directly impacts
the viewer’s experience of the information. The process may include:
 format checks
 completeness checks
 reasonableness checks
 limit checks
 review of the data to identify outliers (geographic, statistical, temporal or
environmental) or other errors,
 assessment of data by subject area experts (e.g. taxonomic specialists)
 missing values
 smooth noisy data
 identify or remove outliers, and
 resolve inconsistencies

d) Data Analysis and visualize your findings.


Show the analysis result in a form of visual representation.
3. Suggests solutions to the kind of text analysis problem that you might encounter in the
chosen dataset.

Submit these 2 documents on Week 8

1. ipynb file (python file)


2. (manual) pdf file which explain about the steps in 1, 2 and 3. Also must include coding & output.

You might also like