DEMOGRAPHICS SEGMENTATION USING MACHINE LEARNING
Submitted by-
Alok Kumar Singh(1803310034)
                                       Submitted to-
Harshit Kumar (1803310090)             Prof. Akash Luthra Sir
Aditya Singh    (1803310020)           (Department of CSE)
                                       RKGIT,GHAZIABAD.
                                      INTRODUCTION
• This project “Demograpihics Segmentation using Machine Learning” is designed for
  observation of diversity of population based on demographics.
• The main aim of this project is to create a machine learning model which will categorize
  data set of population into groups for better understanding of the traits shown by them.
• Demographic segmentation is understanding how different demographic groups perceive
  your brand. Segmentation is different than targeting, as segmentation involves identifying
  how groups of people are different.
                             PROBLEM STATEMENT
As we know that India is one of the largest growing economies in the world ,and with high
population. India is known for its rich diversity with More than 400 cities have populations
of 100,000 or more with different religions and ethnicity etc.
Suppose a marketing firm wants to launch a promotional campaign across different regions
of country so in order to do that the firm wants to understand the diversity in population
demography so that it can plan the promotional campaign accordingly.
The objective is to segregate the regions into different groups so that marketing team can
plan their resources accordingly
                                SOLUTION
• Machine Learning based “Demographics Segmentation is helpful in finding the
  pattern for the diversity of population and thus grouping people who shows same
  traits.
•    Demographic segmentation will help to actually combine different variables to get to
    define a demographic profile. This profile provides specific info regarding a typical
    member of a given group of people.
• Thus allowing marketing analysts and demographics experts to paint a picture of what
  the average consumer of that particular segment is like.
                      LITERATURE SURVEY
• It is well said that data is the new oil ,so with the presence of dataset , we
  can better analyse and build model for more efficiency and better growth.
• Marketing firms are using segmentation study to divide large,
  heterogeneous groups into smaller markets that may be more responsive to
  a targeted product or message.
•    Breaking markets down into smaller groups is more efficient for both the
    brand and the consumer because brands can know whom to ignore and
    consumers are more likely to receive only messaging about products they
    are likely
                            PROPOSED MODULES
• CLUSTERING
Clustering is the task of dividing the population or data points into a number of groups
 such that data points in the same groups are more similar to other data points in the same
group than those in other groups.
• K-MEANS
 K-means algorithm identifies k number of centroids, and then allocates every data point
to the nearest cluster, while keeping the centroids as small as possible.
• EUCLIDEAN DISTANCE
Euclidean Distance represents the shortest distance between two points.
                         SOFTWARES USED
• Jupyter Notebook
  for data preprocessing ,applying model and visualising the results.
• Github and Heroku
  for model deployment.
• VsCode IDE
  to create frontend.
Thank You