Introduction to
Cloud Computing
                        Prepared by: Sushil Sah
By-Sushil Sah                                     1
  Cloud and dynamic Infrastructure
                          Asset
Service management
                        management
 Virtualization and      Information
   consolidation        infrastructure
  Energy Efficiency      Security
     Resilience
    Service-Management
                     visibility
                      Automation
Service-management
                                   Business and IT Firms
  (saas,paas,iaas)
                      Control
   Cloud dynamic
    infrastructure
           Service -management
   This type of special facility or a functionality is provided
    to the cloud IT services by the cloud service providers.
   This facility includes visibility, automation and control to
    delivering the first class IT services.
      Asset-Management
                      Asset values are
                       are maximizing
                      Server
    Platform
                      services            Assets Of cloud
                                              service
                                             providers
               hardware
    Cloud dynamic                    Asset-management
     infrastructure
ASSET-MANAGEMENT
 In this actually the assets or the property which is
involved in providing the cloud services are getting
managed.
 They are getting managed in such a way so that their
value will got maximized.
    Virtualization and consolidation
          Operating
           costs                    platform
          Going downward              service
               or
            Reducing
                            Effective utilization
     Cloud dynamic
      infrastructure
     Virtualization and consolidation
   Here in the above fig it is clearly stated that ”resources
    are getting utilized more and more efficiently.
   Also the operating cost of the systems is getting down.
Information-Infrastructure
                             compliance
        Information
                             Availability
 Business Firms
                              retention
                              Security
    Information-Infrastructure
   It helps the business organizations to achieve the
    following :
   Information compliance,availibility of resources retention
    and security objectives.
            Energy-Efficiency
   Here the IT infrastructure or organization sustainable.
   It means it is not likely to damage or effect any other
    thing.
              Security
Platform          services
                             Managing- risk
 Infrastructure
      Customizing the               Cloud
       governance
                                    dynamic
                                 infrastructure
                       Security
   This cloud infrastructure is responsible for the risk management,
    customizing the governance.
   Risk management Refers to the risks involves in the services which
    are being provided by the cloud-service providers.
   Customization of governance implies that the features of the
    governing body or admin body can be changed but these changes
    are totally depends on the providers wish.
                 Resilience
   This infrastructure provides the feature of resilience
    means the services are resilient.
   It means the infrastructure is safe from all side.
   The IT operations will not be easily get affected.
DEFINITION: Grid computing
   Grid computing is the federation of computer resources
    from multiple administrative domains to reach a common
    goal.
   “computational of grid is a hardware and software
    infrastructure that provides independent pervasive
    and inexpensive access to high end computational
    capabilities”.
What is Grid Computing?
   Computational Grids
       Homogeneous (e.g., Clusters)
       Heterogeneous (e.g., with one-of-a-kind instruments)
   Cousins of Grid Computing
   Methods of Grid Computing
Computational Grids
   Each user A network of geographically distributed
    resources including computers, peripherals, switches,
    instruments, and data.
   should have a single login account to access all
    resources.
   Resources may be owned by diverse organizations.
Computational Grids
   Grids are typically managed by grid ware.
   Grid ware can be viewed as a special type of middleware
    that enable sharing and manage grid components based
    on user requirements and resource attributes (e.g.,
    capacity, performance, availability…)
Cousins of Grid Computing
   Parallel Computing
   Distributed Computing
   Peer-to-Peer Computing
   Many others: Cluster Computing, Network Computing,
    Client/Server Computing, Internet Computing, etc...
Distributed Computing
   People often ask: Is Grid Computing a fancy new name
    for the concept of distributed computing?
   In general, the answer is “no.” Distributed Computing is
    most often concerned with distributing the load of a
    program across two or more processes.
PEER2PEER Computing
   Sharing of computer resources and services by direct
    exchange between systems.
   Computers can act as clients or servers depending on
    what role is most efficient for the network.
Methods of Grid Computing
   Distributed Supercomputing
   High-Throughput Computing
   On-Demand Computing
   Data-Intensive Computing
   Collaborative Computing
   Logistical Networking
Distributed Supercomputing
   Combining multiple high-capacity resources on a
    computational grid into a single, virtual distributed
    supercomputer.
   Tackle problems that cannot be solved on a single
    system.
High-Throughput Computing
   Uses the grid to schedule large numbers of loosely
    coupled or independent tasks, with the goal of putting
    unused processor cycles to work.
On-Demand Computing
   Uses grid capabilities to meet short-term requirements
    for resources that are not locally accessible.
   Models real-time computing demands.
Collaborative Computing
   Concerned primarily with enabling and enhancing
    human-to-human interactions.
   Applications are often structured in terms of a virtual
    shared space.
Logistical Networking
   Global scheduling and optimization of data movement.
   Contrasts with traditional networking, which does not
    explicitly model storage resources in the network.
   Called "logistical" because of the analogy it bears with
    the systems of warehouses, depots, and distribution
    channels.
Who Needs Grid Computing?
   A chemist may utilize hundreds of processors to screen
    thousands of compounds per hour.
   Teams of engineers worldwide pool resources to analyze
    terabytes of structural data.
   Meteorologists seek to visualize and analyze petabytes
    of climate data with enormous computational demands.
An Illustrative Example
   Tiffany Moisan, a NASA research scientist, collected
    microbiological samples in the tidewaters around
    Wallops Island, Virginia.
   She needed the high-performance microscope located at
    the National Center for Microscopy and Imaging
    Research (NCMIR), University of California, San Diego.
   She sent the samples to San Diego and used NPACI’s
    Telescience Grid and NASA’s Information Power Grid
    (IPG) to view and control the output of the microscope
    from her desk on Wallops Island. Thus, in addition to
    viewing the samples, she could move the platform
    holding them and make adjustments to the microscope.
CONT…..
   The microscope produced a huge dataset of images.
   This dataset was stored using a storage resource broker
    on NASA’s IPG.
   Moisan was able to run algorithms on this very dataset
    while watching the results in real time.
                  Grid Users
   Grid developers
   Tool developers
   Application developers
   End Users
   System Administrators
Grid Developers
   Very small group.
   Implementers of a grid “protocol” who provides the basic
    services required to construct a grid.
Tool Developers
   Implement the programming models used by application
    developers.
   Implement basic services similar to conventional
    computing services:
       User authentication/authorization
       Process management
       Data access and communication
Application Developers
   Construct grid-enabled applications for end-users who
    should be able to use these applications without concern
    for the underlying grid.
   Provide programming models that are appropriate for
    grid environments and services that programmers can
    rely on when developing (higher-level) applications.
System Administrators
   Balance local and global concerns.
   Manage grid components and infrastructure.
   Some tasks still not well delineated due to the high
    degree of sharing required.
ADVANTAGE
   Can solve larger, more complex problems in a shorter
    time
   Easier to collaborate with other organizations
   Make better use of existing hardware
DISADVANTAGE
   Grid software and standards are still evolving
   Learning curve to get started
   Non-interactive job submission
The grid- Present, Past, Future
    Number of derivatives in grid computing. Share resources and different
     architecture.
1.    Compute Grids
2.    Data Grids
3.    Science Grids
4.    Access Grids
5.    Knowledge Grids
6.    Cluster Grids
7.    Terra Grids
8.    Commodity Grids
1.   Compute Grids vendors:
         Grid Gain - Professional Open Source
         JPPF - Open Source
2.   Data Grids vendors:
        Oracle Coherence- Commercial
        GemStone- Commercial
        GigaSpaces – Commercial
        JBossCache - Professional Open Source
        EhCache- Open Source
Data
  Functional data requirements for Grid Computing applications are:
       •To integrate multiple distributed, heterogeneous, and independently
       managed data sources.
       •Data transfer mechanisms
       •Data caching and/or replication mechanisms to minimize network
       traffic.
       •Data discovery mechanisms
       •Data encryption and integrity
       •Backup/restore mechanisms and policies
Computation
Functional computational requirements for grid applications are:
      •Independent management of computing resources.
      •Intelligently and transparently select computing resources.
      •Availability, dynamic resource configuration,
      •Failure detection and failover mechanisms.
      •Secure resource management, access, and integrity.
Computational and Data Grids
Data requirements in the early grid solutions:
   Discover data.
   Databases, utilizing meta-data and other attributes of the data.
   The provisioning of computing facilities for high-speed data movement.
   Flexible data access and data filtering capabilities.
Current Grid Activities
    Sharing of resources can be different in present grid.
1.    Computing power
2.    Data
3.    Hardware
4.    Software
5.    Network services
Dynamic benefits of coordinated resource sharing in a virtual
organization.
      The usage patterns found within each of the virtual organizations.
   A virtual organization for weather prediction. For example, this
    virtual organization requires resources such as weather prediction software
    applications to perform the mandatory environmental simulations
    associated with predicting weather.
   A virtual organization for financial modeling. For example, this
    virtual organization requires resources such as software modeling tools for
    performing a multitude of financial analytics, virtualized blades to run the
    above software, and access to data storage facilities for storing and
    accessing data.
Number of requirements for Grid Computing architecture
Three categories
1.   Resource categories
2.   Virtual organization
3.   Users/Applications
Providing facilities for the following scenarios:
   Dynamic discovery of computing resources, based on their capabilities        and
    functions.
   Immediate allocation and provisioning of these resources, based on          their
    availability and the user demands or requirements.
   The management of these resources to meet the required service level agreements
    (SLAs).
   The provisioning of multiple autonomic features for the resources, such as self-
    diagnosis, self-healing, self-configuring, and self-management.
   The provisioning of secure access methods to the resources, and bindings with the
    local security mechanisms based upon the autonomic control policies.
Virtual organization must be capable of providing facilities for:
   Virtual task forces, or groups, to solve specific problems associated with the virtual
    organization.
   Dynamic collection of resources from heterogeneous providers based upon users' needs and
    the sophistication levels of the problems.
   Dynamic identification and automatic problem resolution of a wide variety of troubles, with
    automation of event correlation, linking the specific problems to the required resource and
    service providers.
   The dynamic provisioning and management capabilities of the resources required meeting the
    SLAs.
   The formation of a secured federation (or governance model) and common management
    model for all of the resources respective to the virtual organization.
   The secure delegation of user credentials and identity mapping to the local domain(s).
   The management of resources, including utilization and allocation, to meet a budget and other
    economic criteria.
Users/applications typically found in Grid Computing environments must
    be able to perform the following characteristics:
   The clear and unambiguous identification of the problem
   The identification and mapping of the resources
   The ability to sustain the required levels of QoS, while adhering to the
    anticipated and necessary SLAs.
   The capability to collect feedback regarding resource status, including
    updates for the environment's respective applications.
GRID INFRASTRUCTURE
   GRID infrastructure forms the core foundation for the
    successful grid applications
        Grid computing infrastructure component must
     address several potentially complicated areas in many
     stages of implementation , they are
    1.   Security
    2.   Resource management
    3.   Information services
    4.   Data management
Diagram:
              GRID APPLICATIONS
   G                                G
   R                                R
   I                                I
   D            RES   INF           D
                             DAT    M
   M            OUR   OR
                              A     I
   I             CE   MATI
   D   SEC            ON            D
       URIT                  MAN    D
   D            MAN          AGE
   L    Y              SER          L
                AGE          MEN
   E            MEN    VIC          E
                              T     W
   W             T      ES
   A                                A
   R                                R
   E                                E
              HOSTING ENVIRONMENT
    Security
   Heterogeneous nature of resources – complicated
    polices - complex security schemes
   These computing resources are hosted in differing
    security domains and Heterogeneous
     platforms
   Security requirements – data integrity , confidentiality
    and information privacy
Contd…
   The grid computing data exchange must be protected
    using secure communication channels including
    SSL/TLS
   Secure message exchange mechanisms such as WS-
    Security
   Security infrastructure – grid security infrastructure (GSI)
Resource management
    Resource management scenarios are
    1.   Resource discovery
    2.   Resource monitoring
    3.   Fault isolation
    4.   Resource provisioning
    5.   Resource monitoring
    6.   Autonomic capabilities
    7.   Service level management activities
Contd…
   Resource management area is the selection of correct
    resource from grid resource pool
   Fully based on SLA
    Information services
   Providing valuable information respective to grid
    computing infrastructure resources
   Service are entirely depends on resource availability,
    capacity and utilization
   The information is valuable and mandatory feedback
    respective to resource managers
   Grid solutions are constructed to reflect portals
   Metrics are helpful in SLA
Data management
   Data forms the single most important asset in a grid
    computing system
   Data maybe – input to the resource – output from the
    resource
   Data must be near to the computation where it is used
   Data storage mechanisms – Storage Area Network
    (SAN) ,network file system, virtual database
contd…
  Developers and providers must factor into decision are
   related to selecting the most appropriate data
   management mechanism for grid computing
   infrastructure
This includes size of –
1. data repositories
2.   resource geographical distribution
3. security requirements
4. schemes for replication
5.   caching facilities