KEMBAR78
Google Insights and public data | PDF
Insights and Data Tools
Nicola Arnold
Industry Analyst – Education, Non-Profits and Government




                                           Google Confidential and Proprietary
Organize the world's
information and make it
 universally accessible
      and useful.
Insights for Search
www.google.com/insights/search




                                 Google Confidential and Proprietary
Google Confidential and Proprietary
Qui
                               z!


What was the fastest
rising search in the UK, in
the last 7 days?

                       Google Confidential and Proprietary
Qui
                               z!


Which city has shown the
most interest in floods in
the last 90 days?

                       Google Confidential and Proprietary
Flu Trends




             Google Confidential and Proprietary
Google Confidential and Proprietary
Google Confidential and Proprietary
Google Public Data




                 Google Confidential and Proprietary
Organize the world's public data and make it
     universally accessible and useful.
Users interest in public data (Demand)
1  education statistics by school         12    oil price
2  unemployment                           13    last names
3  population                             14    poverty statistics
    population, cities                    15    mortality
    population, density                            mortality, swine flu
    population, growth                             mortality, infant
4 sales tax                                        mortality, teen suicides
5 salaries                                16    election results
6 exchange rates                          17    consumer price index/inflation
7 crime statistics                        18    cost of living
    crime statistics, human trafficking   19    accident statistics
    crime statistics, homicides                    accident statistics, car/traffic
    crime statistics, hate crime                   accident statistics, drunk driving
8 prevalence                                       accident statistics, distracted driving
    aids                                  20   gas price
    alcohol abuse                         21   prison statistics
    drug abuse                            22   earthquake statistics
9 GDP                                     23   obesity statistics
    GDP, nominal                          24   solar energy
    GDP, real                                      solar energy, production
10 minimum wage                                    solar energy, costs
11 disaster statistics                    25    baby names
    disaster statistics, hurricanes
    disaster statistics, floods                   Statistical topics of interest, based on Google.com queries
    disaster statistics, storms
One box




          Google Confidential and Proprietary
Knowledge Panel




                  Google Confidential and Proprietary
Easy-to-Explore
International database - Fertility rate




Map visualization
Embedding (chart), Embedding (application), Sharing
Some examples
•    International:
      o  Aids (PDE)
      o  Life expectancy (1box)
      o  Military expenditures (1box)
      o  Gender equality - representatives in parliament (PDE)
      o  Fastest internet (PDE)

•    USA:
      o  Federal Government Spending
          §  Outlays for statistical offices
      o  Energy mix (production)
      o  Sexually transmitted diseases
      o  Average house prices
Re-usable
Full page embed (UNDP)
And lots more!




                 Google Confidential and Proprietary
Appendix




           Google Confidential and Proprietary
Organize         (1)


Dataset Publishing Language                Metadata contents
  (DSPL)                                    1. Dataset info (name, description, URL,
 • Designed for interactive exploration      etc.)
   and visualization                        2. Provider info (name, description, URL,
 • Released under BSD, open source            etc.)
   license                                  3. Concepts
 • Combines data tables (CSV) with              o Dimensions (e.g., "time", "country",
                                                  "gender", "state") --> canonical
   metadata (XML)                                 concepts
 • Works best with categorical, time            o Metrics (e.g., "population",
  series data ... ... but can represent           "unemployment")
  generic collections of tables too         4. Slices
                                            5. Tables
                                            6. Topics

                                                                       Google Confidential and Proprietary
Organize (2)
Organize (3)
  • Bundle (zip) xml + csv files
  • Upload on http://www.google.com/publicdata/admin
  • --> Instant visualization!
See http://code.google.com/apis/publicdata for more details



Typically data transformation steps are in between published content and
DSPL .csv slices.

Google Insights and public data

  • 1.
    Insights and DataTools Nicola Arnold Industry Analyst – Education, Non-Profits and Government Google Confidential and Proprietary
  • 2.
    Organize the world's informationand make it universally accessible and useful.
  • 3.
    Insights for Search www.google.com/insights/search Google Confidential and Proprietary
  • 4.
  • 5.
    Qui z! What was the fastest rising search in the UK, in the last 7 days? Google Confidential and Proprietary
  • 6.
    Qui z! Which city has shown the most interest in floods in the last 90 days? Google Confidential and Proprietary
  • 10.
    Flu Trends Google Confidential and Proprietary
  • 11.
  • 12.
  • 13.
    Google Public Data Google Confidential and Proprietary
  • 14.
    Organize the world'spublic data and make it universally accessible and useful.
  • 15.
    Users interest inpublic data (Demand) 1 education statistics by school 12 oil price 2 unemployment 13 last names 3 population 14 poverty statistics population, cities 15 mortality population, density mortality, swine flu population, growth mortality, infant 4 sales tax mortality, teen suicides 5 salaries 16 election results 6 exchange rates 17 consumer price index/inflation 7 crime statistics 18 cost of living crime statistics, human trafficking 19 accident statistics crime statistics, homicides accident statistics, car/traffic crime statistics, hate crime accident statistics, drunk driving 8 prevalence accident statistics, distracted driving aids 20 gas price alcohol abuse 21 prison statistics drug abuse 22 earthquake statistics 9 GDP 23 obesity statistics GDP, nominal 24 solar energy GDP, real solar energy, production 10 minimum wage solar energy, costs 11 disaster statistics 25 baby names disaster statistics, hurricanes disaster statistics, floods Statistical topics of interest, based on Google.com queries disaster statistics, storms
  • 16.
    One box Google Confidential and Proprietary
  • 17.
    Knowledge Panel Google Confidential and Proprietary
  • 18.
    Easy-to-Explore International database -Fertility rate Map visualization Embedding (chart), Embedding (application), Sharing
  • 19.
    Some examples •  International: o  Aids (PDE) o  Life expectancy (1box) o  Military expenditures (1box) o  Gender equality - representatives in parliament (PDE) o  Fastest internet (PDE) •  USA: o  Federal Government Spending §  Outlays for statistical offices o  Energy mix (production) o  Sexually transmitted diseases o  Average house prices
  • 20.
  • 21.
    And lots more! Google Confidential and Proprietary
  • 27.
    Appendix Google Confidential and Proprietary
  • 28.
    Organize (1) Dataset Publishing Language Metadata contents (DSPL) 1. Dataset info (name, description, URL, • Designed for interactive exploration etc.) and visualization 2. Provider info (name, description, URL, • Released under BSD, open source etc.) license 3. Concepts • Combines data tables (CSV) with o Dimensions (e.g., "time", "country", "gender", "state") --> canonical metadata (XML) concepts • Works best with categorical, time o Metrics (e.g., "population", series data ... ... but can represent "unemployment") generic collections of tables too 4. Slices 5. Tables 6. Topics Google Confidential and Proprietary
  • 29.
  • 30.
    Organize (3) • Bundle (zip) xml + csv files • Upload on http://www.google.com/publicdata/admin • --> Instant visualization! See http://code.google.com/apis/publicdata for more details Typically data transformation steps are in between published content and DSPL .csv slices.