Web Application Architectures
What is an Architecture?
Architecture describes structure: According to (Bass et al. 1998), the architecture of a software system
consists of its structures, the decomposition into components, and their interfaces and relationships. It
describes both the static and the dynamic aspects of that software system, so that it can be considered a
building design and flow chart for a software product.
Architecture forms the transition from analysis to implementation: When we create architecture we
try to break the functional requirements and quality requirements down into software components and
their relationships and interfaces in an iterative approach. This process is supported by a number of
approaches, such as the Unified Process.
Architecture can be looked at from different viewpoints:
the conceptual view, which identifies entities of the application domain and their relationships;
the runtime view, which describes the components at system runtime, e.g., servers, or
communication connections;
the process view, which maps processes at system runtime, while looking at aspects like
synchronization and concurrency;
The implementation view, which describes the system’s software artifacts, e.g., subsystems,
components, or source code.
This differentiation into different viewpoints is also supported by modeling languages, e.g., the Unified
Modeling Language – UML.
Architecture makes a system understandable: Structuring software systems and breaking them down
into different perspectives allows us to better manage the complexity of software systems, and the
systems become easier to understand. In addition, the abstraction of system aspects facilitates the
communication of important architectural issues.
Patterns
Patterns describe recurring design problems, which arise in a specific design context, and propose
solutions. A solution describes the participating components, their responsibilities, the relationship
between these components, and the interplay of these components within the specific problem. This
means that patterns enable us to reuse proven and consolidated design knowledge, supporting the
development of high-quality software systems.
Buschmann et al. (1996) identifies patterns on three different abstraction levels:
Architecture patterns: These patterns map fundamental structuring mechanisms for software
systems. They describe architectural subsystems, their responsibilities, relationships, and
interplay. One example of this type of pattern is the Model-View-Controller (MVC) pattern.
Design patterns: These patterns describe the structure, the relationships, and the interplay
between components to solve a design problem within a defined context.
Idioms: describe patterns that refer to a specific implementation in a programming language,
such as, for example, the Counted-Pointer idiom for storage management in C++.
Frameworks
Frameworks represent another option to reuse existing architectural knowledge. A framework is a
reusable software system with general functionality already implemented. The framework can be
specialized into a ready-to-use application. ). The framework serves as a blueprint for the basic
architecture and basic functionalities for a specific field of application.
Categorizing of Architecture
Anastopoulos and Romberg describe architectures for Web application environments, taking the
layering aspect of architectures, or the support of different data and data formats – the data aspect of
architectures – into account:
Layering aspect: Layering means that software systems are structured in several tiers to
implement the principle of “separation of concerns” within a software system.
Data aspect: Data can be structured or non-structured. Structured data follow a defined
scheme like tables in a relational database or XML structures in a document. Non-structured
data are multimedia contents, e.g., images, audio, and video, which typically do not follow an
explicit scheme. This makes their automatic processing difficult.
The increasing distribution of software systems has led to the development of architectures and
infrastructures addressing the distribution of data and messages:
Distributed Object Middleware (DOM): This type of infrastructure allows to access remote
objects transparently. It is based on the Remote Procedure Call (RPC) mechanism. Some DOM
systems also enable objects on different platforms to interact.
Virtual Shared Memory (VSM): The VSM model lets distributed processes access common data.
The processes themselves access a shared memory.
Message Oriented Middleware (MOM): MOM systems offer functionalities for asynchronous
transmission of messages. Asynchronous communication differs from synchronous
communication in that messages are sent to the receiver regardless of its status, e.g., the
receiver may not be available when the message is sent, and i.e., he or she may be offline. MOM
ensures that messages are delivered nevertheless.
Peer to Peer (P2P): P2P stands for direct communication between two devices – the peers – in a
system without using a server, i.e., they communicate over a point-to-point connection.
Components of a Generic Web Application Architecture
Communication between these components is generally based on the request– response principle, i.e.,
one component (e.g., a Web browser) sends a request to another component (e.g., a Web server), and
the response to this request is sent back over the same communication channel (synchronous
communication).
The following list briefly describes each of these components:
Client: Generally a browser (user agent) is controlled by a user to operate the Web application.
Firewall: A piece of software regulating the communication between insecure networks (e.g.,
the Internet) and secure networks (e.g., corporate LANs). This communication is filtered by
access rules.
Proxy: A proxy is typically used to temporarily store Web pages in a cache. However, proxies can
also assume other functionalities, e.g., adapting the contents for users (customization), or user
tracking.
Web server: A Web server is a piece of software that supports various Web protocols like HTTP,
and HTTPS, etc., to process client requests.
Database server: This server normally supplies an organization’s production data in structured
form, e.g., in tables.
Media server: This component is primarily used for content streaming of non-structured bulk
data.
Content management server: Similar to a database server, a content management server holds
contents to serve an application. These contents are normally available in the form of semi-
structured data, e.g., XML documents.
Application server: An application server holds the functionality required by several
applications, e.g., workflow or customization.
Legacy application: A legacy application is an older system that should be integrated as an
internal or external component.
Layered Architectures
2-Layer Architectures: A 2-layer architecture according to also called client/server
architecture, uses a Web server to provide services to a client.
This architecture is suitable particularly for simple Web applications. In contrast, a multi-layer
architectural approach is required for more demanding applications which are accessed by a
large number of concurrent clients or which provide complex business processes requiring the
access to legacy systems, amongst others.
N-Layer Architectures:
N-layer architectures typically consist of three layers, the data layer, providing access to the
application data, the business layer, hosting the business logic of the application in an
application server, and finally the presentation layer, which renders the result of the request in
the desired output format. Additionally, security mechanisms like firewalls, or caching
mechanisms like proxies, can be integrated into the request-response flow upon demand.
JSP-Model
JSP-Model (Java Server Pages) architecture, implements the MVC pattern for Web applications,
thus laying the foundation for the integration of navigation aspects, internationalization, and
multi-platform delivery in Web applications.
The JSP-Model architecture is deployed on a Web server, i.e., view, controller, and parts of the
model functionalities of this pattern are available over a Web server extension – a servlet
container.
Struts
The JSP-Model architecture is enhanced by the Struts open-source project of the Apache
Software Foundation. Struts offers useful additions for Web applications, such as error handling
and internationalization. In addition, Struts uses an XML configuration file which allows the
control of the processing flow within the MVC pattern to facilitate the processing of client
requests.
OOHDM
Object Oriented Hypermedia Design Method is a method for the development of Web
applications which consists of five activities, requirements gathering, conceptual design,
navigational design, abstract interface design and implementation. The Object-Oriented
Hypermedia Design Method is a mature approach for building hypermedia and Web applications
by describing different design models which are then mapped onto a running application. Its
implementation is based on the MVC pattern. In contrast to JSP-Model and Struts, this approach
introduces an explicit navigation component.
In this figure, the execution sequence is indicated by the numbered edges: (1) An HTTP request
is issued to the HTTP Request Parser, which forwards a message to the Dispatcher (2). Similar to
Struts, this parser runs the allocated application objects (3). Subsequently, the selected
application object or other information (e.g., user agent) is used to identify the user interface
(4). Next, the user interface is enriched by navigation aspects (5). And finally, the result is put in
an appropriate layout and transmitted to the client.
Integration Architectures
External or internal systems, e.g., existing applications, existing databases and interfaces to
external business partners, can be integrated into Web applications on three levels: the
presentation level, the application logic level, and the content level. Integration architectures
address integration aspects on the content level and the application logic level and are
commonly summarized under the term Enterprise Application Integration (EAI) architectures. ,
EAI focuses on the integration of legacy systems.
This integration can be implemented in any of the following ways:
Point to point: The applications exchange the content to be integrated via external
communication mechanisms, e.g., file transfer or batch transactions.
Data delivery: A system supports access to internal data directly, e.g., over a database
access.
Data integration: Several applications use the same data storage, e.g., via a jointly used
database.
Delivery of functionalities: A legacy system allows to access functionalities, e.g., over an
API.
Reconstruction of functionalities: Access to functionalities is transparent to the client,
i.e., a functionality can, but does not have to, be implemented directly in the legacy
system.
Porting: Legacy systems are migrated onto Web platforms, replacing the original
system.
Drawback
The major drawback of most current integration approaches is that the application logic and
data in legacy systems can be accessed only in their entirety. This has a negative impact on
the adaptability and reusability of both the data and the application logic. Another problem
is due to the different development paradigms used by legacy systems and Web
applications.
Data-aspect Architectures
Data can be grouped into either of three architectural categories:
structured data of the kind held in databases;
Documents of the kind used in document management systems;
Multimedia data of the kind held in media servers.
(1) Database-centric Architectures
A number of tools and approaches is available to integrate databases into Web applications.
These databases are accessed either directly from within Web server extensions (in the case of
2-layer architectures), or over application servers (in the case of n-layer architectures). Since
database technologies (and particularly relational databases) are highly mature, they are easy to
integrate. APIs are available for different platforms.
(2) Architectures for Web Document Management
In addition to structured data held in databases and multimedia data held on media servers,
contents of Web applications are often processed in the form of documents. Content
management architectures support the integration of documents from different sources,
representing a mechanism to integrate these contents into Web applications.
(3) Architectures for Multimedia Data
The ability to handle large data volumes plays a decisive role when designing systems that use
multimedia contents. While the data volume is normally not decisive in database-centric Web
applications, it influences the architecture and the design of multimedia Web applications
considerably.
Basically, multimedia data, i.e., audio and video, can be transmitted over standard Internet
protocols like HTTP or FTP, just like any other data used in Web applications. This approach is
used by a large number of current Web applications, because it has the major benefit that no
additional components are needed on the server.