KEMBAR78
DATA VISUALIZATION.pptx
DATA VISUALIZATION
Introduction
 Visualization is a pictorial or visual representation technique.
 Anything which is represented in pictorial or graphical form with the help of
diagrams, charts, pictures, and flowcharts are known as visualization.
 Data visualization is a pictorial or visual representation of data with the help
of visual aids such as graphs, bars, histograms, tables, pie charts, mind maps
etc.
 Depending upon the complexity of the data and the aspects from which it is
analysed, visuals can vary in terms of their dimensions (1-D,2-
D/Multidimensional) or types such as a temporal, hierarchical, network.
 All these visuals are used for presenting different types of datasets.
 Different types of tools are available in the market for visualizing data.
 One of the most commonly used data visualization tools is Tableau, which is
available for free as Tableau Public.
 We can use Tableau products for both individual as well as corporate purposes
creating several types of 1D,2D and 3D visualizations.
Ways for representing Visual Data
 The data is first analyzed and then the result of that analysis is visualized as:
 Infographics
 Data visualization
 Infographics are the visual representation of information or data rapidly or
accurately.
 The use of colorful graphics in drawing charts and graphs helps in improving
the interpretation of the given data.
Infographics
 Infographics tell a premeditated story to guide the audience to conclusions (subjective).
 Infographics are visual representations of facts, events, or numbers that reflect patterns and align
with a story.
 They not only represent specific data points but information as well. They are instruments for
reasoning about qualitative information.
 Infographics are:
 Emphasizes a structured narrative rather than patterns in data
 Typically shows simple, aggregated data points to support the narrative
 Are typically static images
 Best for telling a premeditated story and offer subjectivity
 Best for guiding the audience to conclusions and points out relationships
 Created manually for one specific dataset
Data visualization
 It is the study of representing the data or information in visual form.
 With the advancement of digital technologies, the scope of multimedia has
increased.
 Visuals in the form of graphs, images, diagrams or animations have
completely proliferated the media industry and the Internet.
 Human mind can comprehend information more easily if it is presented in the
form of visuals.
 Instructional designers focus on abstract and model-based scientific
visualizations to make the learning content more interesting and easy to
understand.
 Scientific data is also represented through digitally constructed images,
created with the help of software.
 Data visualizations are visual representation of data abstracted into a schematic form so that the
audience can more easily process the information and get a clear idea about the data at a glance.
 They help you understand trends, patterns and to make correlations. They are instruments for
reasoning quantitative information.
 Data visualizations are:
 Highlights patterns in the data for you to draw your own conclusions
 Dives deep into data and uses visual representation to surface trends, relationships and patterns.
 May use dynamic interactions to help you explore the data
 Best for allowing the audience to draw their own conclusions, and offer objectivity
 Ideal for understanding data at a glance
 Automatically generated for arbitrary datasets
 Visualization is an excellent medium to analyse, comprehend and share
information.
 Visual images help to transmit a huge amount of information to the human brain at
a glance.
 They help in establishing relationships and distinction between different patterns
or processes easily.
 They help in exploring data from different angles which help gain indights.
 They help in identifying problems and understanding trends and outliers.
 They point out key or interesting break throughs in a large dataset.
 Data can be classified on the basis of the following three criteria irrespective
of whether it is presented as data visualization or infographics.
 Method of creation- It refers to the type of content used while creating any
graphical representation.
 Quantity of data displayed- it refers to the amount of data which is
represented. For eg: geographical maps, companies’ financial data, etc.
 Degree of creativity applied- It refers to the extent to which the data is
created graphically or designed in a colourful way or it is just showing some
important data in black and white diagrams.
 On the basis of above evaluation, we can understand which is the correct
form of representation for a given data type.
 Graphs
 Diagram
 Timeline
 Template
 Checklist
 Flowchart
 Mindmap
Graph
A Graph is also a visual representation
tool, used in Data Visualization.
It presents information as a series of
coordinates displayed on a multi-
dimensional axis.
Each value in a coordinate is related to the
other(s) through some mathematical, or
time-based, relationship.
A representation in which X and Y axes are
used to depict the meaning of the
information.
Diagram
A 2-D representation of
information to show how
something works
Timeline
A timeline is a chart that depicts how a set of
resources are used over time.
If you're managing a software project and want
to illustrate who is doing what and when, or if
you're organizing a conference and need to
schedule meeting rooms, a timeline is often a
reasonable visualization choice.
One popular type of timeline is the Gantt chart.
A representation of important events in a
sequence with the help of self-explanatory
visual material.
Template
A layout is a design for
presenting
information.
Checklist
A list of items for
comparison and
verification
Flowchart
A representation of
instructions that
shows how something
works or a step-by-
step procedure to
perform a task.
Mind Map
A mind map is a diagram used to
visually organise information.
A mind map is often created
around a single concept, drawn as
an image in the center of a blank
landscape page, to which
associated representations of ideas
such as images, words and parts of
words are added.
Techniques used for visual data
representation
 Data can be presented in various visual forms which include simple line diagrams, bar graphs, tables,
matrices etc.
 Some techniques used for a visual presentation of data are as follows:
 Isoline
 Iso surface
 Direct volume rendering
 Stream line
 Map
 Parallel coordinate plot
 Venn diagram
 Time line
 Euler diagram
 Hyperbolic trees
 Cluster diagram
 Ordinogram
Isoline
It is a 2-D data representation of a curved line that
moves constantly on the surface of a graph.
The plotting of an isoline is based on data
arrangement rather than data visualization.
Isolines are lines drawn to link different places
that share a common value. The prefix 'iso' is a
greek word meaning equal, so an isoline must be
a line joining equal points.
For example, a line drawn on a map to join up all
the places that are the same height above sea
level
Isoline: It is a line on a map, chart or a graph
connecting points of equal value.
Eg: A line drawn on a map to join all the places
that are the same height above the sea level.
They are graphical tools used to denote
geographic lines of equal value. When we study
weather and climate, we will use several kinds of
isolines, such as isotherms to show temperatures
and isobars to show atmospheric pressure.
Isosurface
An isosurface is a three-dimensional analog of an isoline.
It is a surface that represents points of a constant value
(e.g. pressure, temperature, velocity, density) within a
volume of space; in other words, it is a level set of a
continuous function whose domain is 3D-space
They are normally displayed using computer graphics and
are used as data visualization methods in computational
fluid dynamics, allowing engineers to study features of a
fluid flow around the objects, such as aircraft wings.
An isosurface may represent a shock wave in supersonic
flight. In medical imaging isosurfaces may be used to
represent regions of a particular density in a 3-D CT scan,
allowing the visualization of internal organ, bones etc.
DVR
In scientific visualization and computer graphics, volume
rendering is a set of techniques used to display a 2D projection
of a 3D discretely sampled data set, typically a 3D scalar field.
... Direct volume rendering is a computationally intensive task
that may be performed in several ways.
Volume rendering is a method for rendering light as it passes
through media, within a 3D region.
A typical 3D dataset is a group of 2D images acquired by a CT,
MRI or micro CT scanner. The opacity is defined using colours.
Eg: Volume rendered CT scan of a fore arm with different color
schemes for muscle, fat, bone and blood.
Stream lines
A handy way of visualizing the flow
of fluid is through stream lines.
A stream line shows the fluid flow.
It shows the direction of flow.
It’s a 2D flow visualization used in
fluid mechanics and aerodynamics
etc.
Map
It is a visual representation of locations
within specific area.
It is depicted on a planar surface.
A map is a visual representation of an entire
area or a part of an area, typically represented
on a flat surface.
The work of a map is to illustrate specific and
detailed features of a particular area, most
frequently used to illustrate geography.
There are many kinds of maps; static, two-
dimensional, three-dimensional, dynamic and
even interactive.
Maps attempt to represent various things, like
political boundaries, physical features, roads,
topography, population, climates, natural
resources and economic activities.
Parallel coordinate plot
Used for plotting multivariate numerical data.
It is a visualization technique of representing
multidimensional data.
Ideal for comparing many variables together
and seeing the relationships between them.
Eg: To compare an array of products with the
same attributes such as car specifications
across different models.
Time line
Timeline diagrams present events during specific
intervals shown chronologically along a line.
These events may be historic, related to a specific
criminal case or business development, or they
may be important milestones in a project.
Timelines are designed to provide a broad
overview of a sequence of events in time.
They don't go into detail, but links to events,
information and images may be added as needed.
A timeline diagram consists of a horizontal bar or
line representing time progressing from left to right.
This bar is marked with events or steps to indicate
when they should or did happen.
In project management, timelines are most useful
for showing important milestones and deadlines
Hyperbolic trees
Often called a hyper tree is an information
visualization and graph drawing method
inspired by hyperbola geometry.
This method can be used on hierarchies.
They are often used to display web
information.
Eg: Visualization of Roget Thesaurus using
XML tool kit in a 2D hyperbolic tree
visualization
Venn diagram
A Venn diagram or set diagram is a diagram that shows all
possible logical relations between a finite collection of sets.
...
They are used to teach elementary set theory, as well as
illustrate simple set relationships in probability, logic,
statistics, linguistics and computer science.
Contained within each set is a collection of objects or
entities that all have something in common.
It enables one to organize information visually to see the
relationships between two or three sets of items.
Benefits and Purpose:
1. To visually organize information
2. To compare 2 or more choices
3. To solve complex mathematical problems
4. To compare data sets
5. To reason through the logic
Euler diagram
An Euler diagram is a diagrammatic
means of representing sets and their
relationships.
Typically they involve overlapping shapes,
and may be scaled, such that the area of
the shape is proportional to the number of
elements it contains.
Euler diagram only shows relationships
that exist in real world.
Circles, Ovals, or other shapes can be
used for Euler diagram
Cluster diagram
A cluster in general is a group or a bunch of
several discrete items that are close to each
other.
A network diagram can be seen as a special
orderly arranged kind of cluster diagram.
A cluster diagram is a mesh kind of network
diagram.
In computer science more
complex diagrams of computer networks,
computer architecture, file systems and internet
can be considered cluster diagrams.
In information visualization specific visual
representation of large-scale collections of
non-numerical information are sometimes
drawn in the shape of a cluster diagram.
Types of Data visualization
 Linear
 Planar
 Volumetric
 Temporal
 Multidimensional
 Tree
 Network
Linear Data Visualization
 Linear Data Visualization- In this visualization technique data always
represented in list format.
 Basically we can’t consider it as a visualization technique rather than it is
consider as a data organization technique.
 Hence in this process no tool is used to visualize the data.
 It is also called as 1D data visualization
Planar data visualization
 In this type of visualization data generally take in the form of images or
charts over a plane surface.
 The best example of this type of data visualization is Cartogram and dot
distribution map.
 Some tools used to build planar data visualization are Geocommons, Google
fusion tables, Google Maps API, Tableau Public, Poly maps etc.
Volumetric Data visualization
 In this approach the presentation of data generally involves with three
dimensions to present the simulations, surface and volume rendering and
commonly used scientific studies.
 Today many organizations use 3D computer modelling and volume rendering in
advertisements to provide users a better feel of their products.
 Basic tools used for it are AC3D, Auto3D, True-space etc.
Temporal Data visualization
 In some approach visualizations are time dependent in nature so to visualize
on the analyses of time the temporal data visualization is used which consist
of Gantt chart, Time series and sanky diagram etc.
 Now-a-days it is widely used to visualize the real time data.
 TimeFlow, Timeline JS, Excel, TimePlot, TimeSearcher, Google Charts,
Tableau Public, Google Fusion tables are the tools for creating temporal data
visualization.
Multidimensional Data Visualization
 In this approach numerous dimensions are generally used to represent the
data.
 Generally pie charts, histograms, bar charts etc are generally used to
multidimensional data visualization.
 Many Eyes, Tableau, Google charts tool is used to create multidimensional
data visualization
Tree Data visualization
 Data relationships need to be shown in the form of tree hierarchies.
 To represent such kind of relationships, we use tree or hierarchical data
visualizations.
 Hyperbolic trees, wedge stack graph are some of the examples.
 Tools: d3, Google Charts, Network bench and Sci2
Network Data visualization
 This approach is generally used to represent the relations that are too
complex in the form of hierarchies.
 Some of the basic tools used for network data visualization are hive plot,
Pajek, Gephi, NodeXL, Google Fusion tables etc.
Applications of data visualization
 Education
 It is applied to teach a topic that requires simulation or modeling of any object or process.
 It is suitable in case where we need to explain any organ or organ system with the help of diagrams
or animation
 Information- it is applied to transform abstract data into visual forms for easy interpretation
and further exploration
 Production- it is used to create 3D models of products for better viewing and manipulation.
Eg: real estate, automobile industry
 Science- every field of science including dynamics, astrophysics and medicine use visuals for
representing information. Isosurfaces and DVR.
 Systems visualization: it is a new concept that integrates visual techniques to describe
complex systems.
 Visual communication: multimedia and entertainment industry use visuals to communicate
ideas and information.
 Visual analytics: it refers to the science of analytical reasoning supported by interactive
visual surface. The data generated by social media interaction is interpreted using visual
analytics techniques.
Visualizing Big Data
 Almost every organization today is struggling to tackle the huge amount of
data pouring in every day.
 Data visualization is a great way to reduce the turn around time consumed in
interpreting Big Data.
 Traditional visualization techniques are not efficient enough to capture or
interpret the information that Big Data possesses.
 Such techniques are not able to interpret videos, audios and complex
sentences.
 The volume and speed with which it is generating pose a great challenge.
 Most of the traditional analytics techniques are unable to cater to any of
these problems.
 Big data comprises both structured and unstructured forms of data collected
from various sources.
 Heterogeneity of data sources, data streaming and real time data are also
difficult to handle by using traditional tools.
 Traditional tools are developed by using relational models that work best on
static interaction.
 Big data is highly dynamic in function and therefore most traditional tools are
not able to generate quality results.
 The response time of traditional tools is quite high making it unfit for quality
interaction.
Deriving business solution
 The most common notation used for Big Data is 3Vs.
 Big data generated through social media sites is a valuable source of information to
understand consumer sentiments and demographics.
 Challenges faced:
 Data is in unstructured form
 Data is not analysed in real time
 The amount of data generated is huge.
 There is a lack of efficient tools and techniques.
 IT companies are focusing more on research and development of robust algorithms, software
and tools to analyze the data that is scattered in the internet space.
 Tools such as Hadoop are providing the state of the art technology to store and process Big
data.
 Analytical tools are now able to produce interpretations on smart phones and tablets.
Turning data into information
 Visualization facilitates identification of patterns in the form of graphs or
charts which in turn help to derive useful information.
 Visual data mining also works on the same principle as data mining; however
it involves the integration of information, visualization and HCI.
 Data Visualization produces cluttered images that are filtered with the help
of clutter reduction techniques.
 Unform sampling and dimension reduction are two commonly used
techniques.
 Visual data reduction process involves automated data analysis to measure
density, outliers and their differences. These measures are then used as
quality metrics to evaluate data reduction activity.
 Visual quality metrics can be categorized as:
 Size metrics
 Visual effectiveness metrics
 Feature perseveration metrics.
 Visual analytics tool should be:
 Simple enough so that even nontechnical users can operate it.
 Interactive to connect with different sources.
 Competent to create appropriate visuals for interpretations.
 Able to interpret big data and share information.
 Visualization tool must be able to establish links between different data
values, the missing data and polish data for further analysis.`
TOOLS USED IN DATA VISUALIZATION
 Excel
 It is a new tool that is used for data analysis. It helps you to track and visualize
data for deriving better insights.
 The tool provides ways to share data and analytical conclusions within across
organization.
 Last.forward
 It is an open-source software provided by last.fm for analysing and visualizing
social music networks.
 Digg.com
 It provides some of the best web based visualization tool
 Pics
 It is used to track activity of images on the website.
 Arc
 It is used to display the topics and stories in a spherical form. Sphere is used to
display stories and topics, and bunches of stories are aligned at the outer
circumference of sphere.
 Google Charts API
 This tool allows user to create dynamic charts to be embedded in a web page. A
chart obtained from data and formatting parameters supplied in a HTTP request is
converted to PNG image by Google to simplify embedding process.
 TwittEarth
 This tool is capable of showing live tweets from all over the world on a 3D globe, it
is an effort to improve social media visualization and provide a global image
mapping in tweets.
 Tag Galaxy
 It provides a stunning way of finding a collection of Flickr images.
 It is a usual site which provides search tool which makes the online combing
process a memorable visual experience.
 If you want to search a picture, you have to enter a tag of your choice and it will
find the picture.
 The central star contains all images directly relating to the initial tag and revolving
planets consist of similar or corresponding tag.
 Click on the planet and additional sub-categories will appear.
 Click on the central star and flickr images gather and land on a gigantic 3D sphere.
 D3
 It allows to bind of arbitrary data to DOM and then applies data-driven
transformations to the document.
 D3 is used to generate an HTML table from the array of numbers or use the same
data to create an interactive SVG bar chart with smooth transitions and
interactions.
 Rootzmap Mapping the Internet
 It is a tool to generate a series of maps on the basis of data sets provided by
National Aeronautics and Space Administration.
Open Source DVT
 Big Data analytics requires the implementation of advanced tools and
technologies.
 Due to economic and infrastructural limitations, every organization cannot
purchase all the applications required for analyzing data.
 To full fill their requirement of advanced tools and technologies, organizations
often turn to open source libraries.
 These libraries can be defined as pools of freely available applications and
analytical tools.
 Eg: VTK, Cave5D, ELKI, Gephi, Tableau public
 Open source tools are easy to use, consistent and resuable. They deliver high
quality performance and are complaint with web as well as web security.
 They provide multichannel analytics for modelling as well as customized business
solutions that can be altered with changing business demands.

DATA VISUALIZATION.pptx

  • 1.
  • 2.
    Introduction  Visualization isa pictorial or visual representation technique.  Anything which is represented in pictorial or graphical form with the help of diagrams, charts, pictures, and flowcharts are known as visualization.  Data visualization is a pictorial or visual representation of data with the help of visual aids such as graphs, bars, histograms, tables, pie charts, mind maps etc.  Depending upon the complexity of the data and the aspects from which it is analysed, visuals can vary in terms of their dimensions (1-D,2- D/Multidimensional) or types such as a temporal, hierarchical, network.  All these visuals are used for presenting different types of datasets.  Different types of tools are available in the market for visualizing data.
  • 3.
     One ofthe most commonly used data visualization tools is Tableau, which is available for free as Tableau Public.  We can use Tableau products for both individual as well as corporate purposes creating several types of 1D,2D and 3D visualizations.
  • 4.
    Ways for representingVisual Data  The data is first analyzed and then the result of that analysis is visualized as:  Infographics  Data visualization  Infographics are the visual representation of information or data rapidly or accurately.  The use of colorful graphics in drawing charts and graphs helps in improving the interpretation of the given data.
  • 5.
  • 6.
     Infographics tella premeditated story to guide the audience to conclusions (subjective).  Infographics are visual representations of facts, events, or numbers that reflect patterns and align with a story.  They not only represent specific data points but information as well. They are instruments for reasoning about qualitative information.  Infographics are:  Emphasizes a structured narrative rather than patterns in data  Typically shows simple, aggregated data points to support the narrative  Are typically static images  Best for telling a premeditated story and offer subjectivity  Best for guiding the audience to conclusions and points out relationships  Created manually for one specific dataset
  • 7.
    Data visualization  Itis the study of representing the data or information in visual form.  With the advancement of digital technologies, the scope of multimedia has increased.  Visuals in the form of graphs, images, diagrams or animations have completely proliferated the media industry and the Internet.  Human mind can comprehend information more easily if it is presented in the form of visuals.  Instructional designers focus on abstract and model-based scientific visualizations to make the learning content more interesting and easy to understand.  Scientific data is also represented through digitally constructed images, created with the help of software.
  • 8.
     Data visualizationsare visual representation of data abstracted into a schematic form so that the audience can more easily process the information and get a clear idea about the data at a glance.  They help you understand trends, patterns and to make correlations. They are instruments for reasoning quantitative information.  Data visualizations are:  Highlights patterns in the data for you to draw your own conclusions  Dives deep into data and uses visual representation to surface trends, relationships and patterns.  May use dynamic interactions to help you explore the data  Best for allowing the audience to draw their own conclusions, and offer objectivity  Ideal for understanding data at a glance  Automatically generated for arbitrary datasets
  • 9.
     Visualization isan excellent medium to analyse, comprehend and share information.  Visual images help to transmit a huge amount of information to the human brain at a glance.  They help in establishing relationships and distinction between different patterns or processes easily.  They help in exploring data from different angles which help gain indights.  They help in identifying problems and understanding trends and outliers.  They point out key or interesting break throughs in a large dataset.
  • 10.
     Data canbe classified on the basis of the following three criteria irrespective of whether it is presented as data visualization or infographics.  Method of creation- It refers to the type of content used while creating any graphical representation.  Quantity of data displayed- it refers to the amount of data which is represented. For eg: geographical maps, companies’ financial data, etc.  Degree of creativity applied- It refers to the extent to which the data is created graphically or designed in a colourful way or it is just showing some important data in black and white diagrams.
  • 11.
     On thebasis of above evaluation, we can understand which is the correct form of representation for a given data type.  Graphs  Diagram  Timeline  Template  Checklist  Flowchart  Mindmap
  • 12.
    Graph A Graph isalso a visual representation tool, used in Data Visualization. It presents information as a series of coordinates displayed on a multi- dimensional axis. Each value in a coordinate is related to the other(s) through some mathematical, or time-based, relationship. A representation in which X and Y axes are used to depict the meaning of the information.
  • 13.
    Diagram A 2-D representationof information to show how something works
  • 14.
    Timeline A timeline isa chart that depicts how a set of resources are used over time. If you're managing a software project and want to illustrate who is doing what and when, or if you're organizing a conference and need to schedule meeting rooms, a timeline is often a reasonable visualization choice. One popular type of timeline is the Gantt chart. A representation of important events in a sequence with the help of self-explanatory visual material.
  • 15.
    Template A layout isa design for presenting information.
  • 16.
    Checklist A list ofitems for comparison and verification
  • 17.
    Flowchart A representation of instructionsthat shows how something works or a step-by- step procedure to perform a task.
  • 18.
    Mind Map A mindmap is a diagram used to visually organise information. A mind map is often created around a single concept, drawn as an image in the center of a blank landscape page, to which associated representations of ideas such as images, words and parts of words are added.
  • 19.
    Techniques used forvisual data representation  Data can be presented in various visual forms which include simple line diagrams, bar graphs, tables, matrices etc.  Some techniques used for a visual presentation of data are as follows:  Isoline  Iso surface  Direct volume rendering  Stream line  Map  Parallel coordinate plot  Venn diagram  Time line  Euler diagram  Hyperbolic trees  Cluster diagram  Ordinogram
  • 20.
    Isoline It is a2-D data representation of a curved line that moves constantly on the surface of a graph. The plotting of an isoline is based on data arrangement rather than data visualization. Isolines are lines drawn to link different places that share a common value. The prefix 'iso' is a greek word meaning equal, so an isoline must be a line joining equal points. For example, a line drawn on a map to join up all the places that are the same height above sea level Isoline: It is a line on a map, chart or a graph connecting points of equal value. Eg: A line drawn on a map to join all the places that are the same height above the sea level. They are graphical tools used to denote geographic lines of equal value. When we study weather and climate, we will use several kinds of isolines, such as isotherms to show temperatures and isobars to show atmospheric pressure.
  • 21.
    Isosurface An isosurface isa three-dimensional analog of an isoline. It is a surface that represents points of a constant value (e.g. pressure, temperature, velocity, density) within a volume of space; in other words, it is a level set of a continuous function whose domain is 3D-space They are normally displayed using computer graphics and are used as data visualization methods in computational fluid dynamics, allowing engineers to study features of a fluid flow around the objects, such as aircraft wings. An isosurface may represent a shock wave in supersonic flight. In medical imaging isosurfaces may be used to represent regions of a particular density in a 3-D CT scan, allowing the visualization of internal organ, bones etc.
  • 22.
    DVR In scientific visualizationand computer graphics, volume rendering is a set of techniques used to display a 2D projection of a 3D discretely sampled data set, typically a 3D scalar field. ... Direct volume rendering is a computationally intensive task that may be performed in several ways. Volume rendering is a method for rendering light as it passes through media, within a 3D region. A typical 3D dataset is a group of 2D images acquired by a CT, MRI or micro CT scanner. The opacity is defined using colours. Eg: Volume rendered CT scan of a fore arm with different color schemes for muscle, fat, bone and blood.
  • 23.
    Stream lines A handyway of visualizing the flow of fluid is through stream lines. A stream line shows the fluid flow. It shows the direction of flow. It’s a 2D flow visualization used in fluid mechanics and aerodynamics etc.
  • 24.
    Map It is avisual representation of locations within specific area. It is depicted on a planar surface. A map is a visual representation of an entire area or a part of an area, typically represented on a flat surface. The work of a map is to illustrate specific and detailed features of a particular area, most frequently used to illustrate geography. There are many kinds of maps; static, two- dimensional, three-dimensional, dynamic and even interactive. Maps attempt to represent various things, like political boundaries, physical features, roads, topography, population, climates, natural resources and economic activities.
  • 25.
    Parallel coordinate plot Usedfor plotting multivariate numerical data. It is a visualization technique of representing multidimensional data. Ideal for comparing many variables together and seeing the relationships between them. Eg: To compare an array of products with the same attributes such as car specifications across different models.
  • 26.
    Time line Timeline diagramspresent events during specific intervals shown chronologically along a line. These events may be historic, related to a specific criminal case or business development, or they may be important milestones in a project. Timelines are designed to provide a broad overview of a sequence of events in time. They don't go into detail, but links to events, information and images may be added as needed. A timeline diagram consists of a horizontal bar or line representing time progressing from left to right. This bar is marked with events or steps to indicate when they should or did happen. In project management, timelines are most useful for showing important milestones and deadlines
  • 27.
    Hyperbolic trees Often calleda hyper tree is an information visualization and graph drawing method inspired by hyperbola geometry. This method can be used on hierarchies. They are often used to display web information. Eg: Visualization of Roget Thesaurus using XML tool kit in a 2D hyperbolic tree visualization
  • 28.
    Venn diagram A Venndiagram or set diagram is a diagram that shows all possible logical relations between a finite collection of sets. ... They are used to teach elementary set theory, as well as illustrate simple set relationships in probability, logic, statistics, linguistics and computer science. Contained within each set is a collection of objects or entities that all have something in common. It enables one to organize information visually to see the relationships between two or three sets of items. Benefits and Purpose: 1. To visually organize information 2. To compare 2 or more choices 3. To solve complex mathematical problems 4. To compare data sets 5. To reason through the logic
  • 29.
    Euler diagram An Eulerdiagram is a diagrammatic means of representing sets and their relationships. Typically they involve overlapping shapes, and may be scaled, such that the area of the shape is proportional to the number of elements it contains. Euler diagram only shows relationships that exist in real world. Circles, Ovals, or other shapes can be used for Euler diagram
  • 30.
    Cluster diagram A clusterin general is a group or a bunch of several discrete items that are close to each other. A network diagram can be seen as a special orderly arranged kind of cluster diagram. A cluster diagram is a mesh kind of network diagram. In computer science more complex diagrams of computer networks, computer architecture, file systems and internet can be considered cluster diagrams. In information visualization specific visual representation of large-scale collections of non-numerical information are sometimes drawn in the shape of a cluster diagram.
  • 31.
    Types of Datavisualization  Linear  Planar  Volumetric  Temporal  Multidimensional  Tree  Network
  • 32.
    Linear Data Visualization Linear Data Visualization- In this visualization technique data always represented in list format.  Basically we can’t consider it as a visualization technique rather than it is consider as a data organization technique.  Hence in this process no tool is used to visualize the data.  It is also called as 1D data visualization
  • 33.
    Planar data visualization In this type of visualization data generally take in the form of images or charts over a plane surface.  The best example of this type of data visualization is Cartogram and dot distribution map.  Some tools used to build planar data visualization are Geocommons, Google fusion tables, Google Maps API, Tableau Public, Poly maps etc.
  • 34.
    Volumetric Data visualization In this approach the presentation of data generally involves with three dimensions to present the simulations, surface and volume rendering and commonly used scientific studies.  Today many organizations use 3D computer modelling and volume rendering in advertisements to provide users a better feel of their products.  Basic tools used for it are AC3D, Auto3D, True-space etc.
  • 35.
    Temporal Data visualization In some approach visualizations are time dependent in nature so to visualize on the analyses of time the temporal data visualization is used which consist of Gantt chart, Time series and sanky diagram etc.  Now-a-days it is widely used to visualize the real time data.  TimeFlow, Timeline JS, Excel, TimePlot, TimeSearcher, Google Charts, Tableau Public, Google Fusion tables are the tools for creating temporal data visualization.
  • 36.
    Multidimensional Data Visualization In this approach numerous dimensions are generally used to represent the data.  Generally pie charts, histograms, bar charts etc are generally used to multidimensional data visualization.  Many Eyes, Tableau, Google charts tool is used to create multidimensional data visualization
  • 37.
    Tree Data visualization Data relationships need to be shown in the form of tree hierarchies.  To represent such kind of relationships, we use tree or hierarchical data visualizations.  Hyperbolic trees, wedge stack graph are some of the examples.  Tools: d3, Google Charts, Network bench and Sci2
  • 38.
    Network Data visualization This approach is generally used to represent the relations that are too complex in the form of hierarchies.  Some of the basic tools used for network data visualization are hive plot, Pajek, Gephi, NodeXL, Google Fusion tables etc.
  • 39.
    Applications of datavisualization  Education  It is applied to teach a topic that requires simulation or modeling of any object or process.  It is suitable in case where we need to explain any organ or organ system with the help of diagrams or animation  Information- it is applied to transform abstract data into visual forms for easy interpretation and further exploration  Production- it is used to create 3D models of products for better viewing and manipulation. Eg: real estate, automobile industry  Science- every field of science including dynamics, astrophysics and medicine use visuals for representing information. Isosurfaces and DVR.  Systems visualization: it is a new concept that integrates visual techniques to describe complex systems.  Visual communication: multimedia and entertainment industry use visuals to communicate ideas and information.  Visual analytics: it refers to the science of analytical reasoning supported by interactive visual surface. The data generated by social media interaction is interpreted using visual analytics techniques.
  • 40.
    Visualizing Big Data Almost every organization today is struggling to tackle the huge amount of data pouring in every day.  Data visualization is a great way to reduce the turn around time consumed in interpreting Big Data.  Traditional visualization techniques are not efficient enough to capture or interpret the information that Big Data possesses.  Such techniques are not able to interpret videos, audios and complex sentences.  The volume and speed with which it is generating pose a great challenge.  Most of the traditional analytics techniques are unable to cater to any of these problems.
  • 41.
     Big datacomprises both structured and unstructured forms of data collected from various sources.  Heterogeneity of data sources, data streaming and real time data are also difficult to handle by using traditional tools.  Traditional tools are developed by using relational models that work best on static interaction.  Big data is highly dynamic in function and therefore most traditional tools are not able to generate quality results.  The response time of traditional tools is quite high making it unfit for quality interaction.
  • 42.
    Deriving business solution The most common notation used for Big Data is 3Vs.  Big data generated through social media sites is a valuable source of information to understand consumer sentiments and demographics.  Challenges faced:  Data is in unstructured form  Data is not analysed in real time  The amount of data generated is huge.  There is a lack of efficient tools and techniques.  IT companies are focusing more on research and development of robust algorithms, software and tools to analyze the data that is scattered in the internet space.  Tools such as Hadoop are providing the state of the art technology to store and process Big data.  Analytical tools are now able to produce interpretations on smart phones and tablets.
  • 43.
    Turning data intoinformation  Visualization facilitates identification of patterns in the form of graphs or charts which in turn help to derive useful information.  Visual data mining also works on the same principle as data mining; however it involves the integration of information, visualization and HCI.  Data Visualization produces cluttered images that are filtered with the help of clutter reduction techniques.  Unform sampling and dimension reduction are two commonly used techniques.  Visual data reduction process involves automated data analysis to measure density, outliers and their differences. These measures are then used as quality metrics to evaluate data reduction activity.
  • 44.
     Visual qualitymetrics can be categorized as:  Size metrics  Visual effectiveness metrics  Feature perseveration metrics.  Visual analytics tool should be:  Simple enough so that even nontechnical users can operate it.  Interactive to connect with different sources.  Competent to create appropriate visuals for interpretations.  Able to interpret big data and share information.  Visualization tool must be able to establish links between different data values, the missing data and polish data for further analysis.`
  • 45.
    TOOLS USED INDATA VISUALIZATION  Excel  It is a new tool that is used for data analysis. It helps you to track and visualize data for deriving better insights.  The tool provides ways to share data and analytical conclusions within across organization.  Last.forward  It is an open-source software provided by last.fm for analysing and visualizing social music networks.  Digg.com  It provides some of the best web based visualization tool  Pics  It is used to track activity of images on the website.
  • 46.
     Arc  Itis used to display the topics and stories in a spherical form. Sphere is used to display stories and topics, and bunches of stories are aligned at the outer circumference of sphere.  Google Charts API  This tool allows user to create dynamic charts to be embedded in a web page. A chart obtained from data and formatting parameters supplied in a HTTP request is converted to PNG image by Google to simplify embedding process.  TwittEarth  This tool is capable of showing live tweets from all over the world on a 3D globe, it is an effort to improve social media visualization and provide a global image mapping in tweets.
  • 47.
     Tag Galaxy It provides a stunning way of finding a collection of Flickr images.  It is a usual site which provides search tool which makes the online combing process a memorable visual experience.  If you want to search a picture, you have to enter a tag of your choice and it will find the picture.  The central star contains all images directly relating to the initial tag and revolving planets consist of similar or corresponding tag.  Click on the planet and additional sub-categories will appear.  Click on the central star and flickr images gather and land on a gigantic 3D sphere.
  • 48.
     D3  Itallows to bind of arbitrary data to DOM and then applies data-driven transformations to the document.  D3 is used to generate an HTML table from the array of numbers or use the same data to create an interactive SVG bar chart with smooth transitions and interactions.  Rootzmap Mapping the Internet  It is a tool to generate a series of maps on the basis of data sets provided by National Aeronautics and Space Administration.
  • 49.
    Open Source DVT Big Data analytics requires the implementation of advanced tools and technologies.  Due to economic and infrastructural limitations, every organization cannot purchase all the applications required for analyzing data.  To full fill their requirement of advanced tools and technologies, organizations often turn to open source libraries.  These libraries can be defined as pools of freely available applications and analytical tools.  Eg: VTK, Cave5D, ELKI, Gephi, Tableau public  Open source tools are easy to use, consistent and resuable. They deliver high quality performance and are complaint with web as well as web security.  They provide multichannel analytics for modelling as well as customized business solutions that can be altered with changing business demands.