0% found this document useful (0 votes)

93 views10 pages

Exasol: Tuning-Free Database Guide

Exasol is a tuning-free database that automatically manages resources and data distribution across nodes through intelligent algorithms. It uses a column-based storage model and massively parallel processing (MPP) architecture to optimize query performance. The query optimizer analyzes data and queries to determine optimal execution plans and transparently manages indexing without user intervention.

Uploaded by

Peter

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views10 pages

Exasol: Tuning-Free Database Guide

Uploaded by

Peter

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

ACADEMY

Exasol - Overview and Concepts

1
Base idea: Creating a tuning free database

– Strong cost-based query optimizer

– Automatic and fast table analyzer
(e.g. column selectivity)
– Automatic & transparent index creation and maintenance

– Automatic resource management

– Optimized for mixed workload
– Throughput orientated
– Usage of priorities possible to influence the resource management

– Automatic data distribution, compression, …

2 Exasol – Overview and Concepts ACADEMY

Base idea: Creating a tuning free database

Due to intelligent algorithms, Exasol gets even faster with its usage and not slower, as
conventional DBMS do. At the same time, the amount of administrative work will be
strongly reduced, because the on-going tuning is performed by Exasol itself. Exasol's
performance is independent from chosen schema type. Data distribution is automatic on
the basis of the usage profile. The optimizer analyzes both data and queries and matches
them without the intervention of the DBA. The system automatically creates and
administers the indexes on the fly on the basis of the query analysis.
Strong cost-based query optimizer
From the beginning, the query optimizer was designed to meet the needs of massively
parallel data processing. Its goal is to ensure that every single node can process as much
data locally as possible. This significantly reduces the communication overhead and
contributes greatly to the excellent scalability of Exasol.
The optimizer figures out the optimal join order on the basis of table statistics and
therefore allows optimal processing of multiple table joins. Due to these sophisticated
mechanisms, Exasol achieves optimal performance with any database model.
Indexing
Indexes are also automatically generated, reutilized and discarded by the system as
necessary. The user can not directly influence this, the executed queries serve as
foundation for the choice of index.
Data distribution across nodes
The system distributes each table automatically according its distribution attributes across
all active nodes in the cluster (shared nothing architecture). Tables below a certain size
(typically < 100K rows) will be replicated. This ensures, that table joins over their
distribution attributes can mostly be processed locally.
The user can affect data distribution by setting distribution attributes.
Shared nothing architecture (MPP processing)
SELECT s.SALES_DATE, s.MARKET_ID, sp.ARTICLE_ID
FROM RETAIL.SALES s JOIN RETAIL.SALES_POSITIONS sp
ON s.SALES_ID = sp.SALES_ID WHERE s.MARKET_ID IN (661, 534, 678, 1990);

2014-09-17
2014-09-17 661
661 94346
94346 2014-08-02 534 96673 2014-08-09
2014-08-09 678
678 94447
94447 2014-11-01 1990 96378

2014-09-17 661 93086 2014-04-08 678 94826 2014-12-22 1990 93803 2014-11-01 1990 93447

2014-11-28 534 93000 … … … … … … 2014-06-21 661 94447

… … … … … …

id sales_date market id sales_date market

id sales_date market
7 2014-09-17 661 4 2014-08-02 534 id sales_date market
8 2014-11-01 1990
10 2014-11-28 534 1 2014-04-08 678 5 2014-08-09 678
2 2014-06-21 661
… … id article 1 2014-12-22 1990
id article
1 …
7 93086 id article id article
7 94346
…
4 96673 8 93803 8 96378
10 93000
11 93803 2 94447 1 94826 5 94447
… … … … … … … …

3 Exasol – Overview and Concepts ACADEMY

Massively Parallel Data Processing

Exasol was developed as a parallel system and is constructed according to the shared-
nothing principle. Data is distributed across all nodes in a cluster. When responding to
queries, all nodes co-operate and special parallel algorithms ensure that most data is
processed locally in each individual node's main memory.
When a query is sent to the system, it is first accepted by the node the client is connected
to. The query is then distributed to all nodes. Intelligent algorithms optimize the query,
determine the best plan of action and generate needed indexes on the fly. The system
then processes the partial results based the local datasets. This processing paradigm is also
known as SPMD (single program multiple data). All cluster nodes operate on an equal basis,
there is no Master Node. The global query result is delivered back to the user through the
original connection.
Column-based storage

– Table values are stored column wise

SALES
SALES_ID SALES_DATE PRICE MARKET_ID

Row 1 1 Becker
2014-04-08 Hans49.91 678
23000

3 2 Weber
2014-06-21 54.65
Peter 661
730000

4 4 Huber
2014-08-02 49.08
Klaus 534
39600

5 5 Schmidt
2014-08-09 80.01
Maria 678
124000

6 7 Schneider
2014-09-17 43.14
Thomas 661
93600

22 10 Fischer
2014-11-28 63.04
Stefan 534
368200

4 Exasol – Overview and Concepts ACADEMY

Column-based storage

Due to Exasol’s specialization on the data warehousing it benefits from the column-based
data storage, reducing the number of IO operations and the overall amount of processed
data. Typically, queries in a data warehouse access only few columns (eg. when joining
tables).
In order to optimize access to the hard disk, columns are partitioned into blocks. This
facilitates maximum throughput and prevents unnecessary data from being imported. Gaps
can occur in the blocks as a result of various operations (e.g. deletions or updates);
however, the system automatically conducts a defragmentation if certain limits are
exceeded.
Compression

– Compression
▪ Faster hard disc access
▪ Less RAM required
SALES
SALES_ID SALES_DATE PRICE MARKET_ID

1 1 …
2014-04-08 49.91
49.91 678678

2 2 …
2014-06-21 54.65
54.65 661661

4 4 …
2014-08-02 49.08
49.08 534534

5 5 …
2014-08-09 80.01
80.01 678678

7 7 …
2014-09-17 43.14
43.14 661661

10 10 …
2014-11-28 63.04
63.04 534534

5 Exasol – Overview and Concepts ACADEMY

Data compression
To optimize RAM utilization, table data is compressed element by element already in main
memory; on basis of the data types and content of the columns, the system automatically
selects a sufficiently effective compression algorithm. Compression is fully transparent to
the user.
Data blocks

– Several values of one column are collected within a block

SALES
SALES_ID SALES_DATE PRICE MARKET_ID

1 2014-04-08 49.91 678

2 2014-06-21 54.65 661

Block 1
4 2014-08-02 49.08 534
Block x
5 2014-08-09 80.01 678

7 2014-09-17 43.14 661

Block 2 2014-11-28 63.04 534
10

6 Exasol – Overview and Concepts ACADEMY

Data blocks

All columns are devided into data blocks to minimize the amount of data loaded or written
to disc. Data that is not needed for a query is not loaded into the RAM.
Blocks may include different numbers of elements, depending on the column type and the
compression algorithm. Due to delete operations holes may occur within blocks.
These holes are automatically refilled by the system.
Data block types

– Three different types of data blocks:

1. Persistent (Data for persistent tables)
2. Temporary (created during query execution)
3. Indexes

– All block types are transparently loaded into RAM on demand

– All block types are treated the same way

7 Exasol – Overview and Concepts ACADEMY

Data block types

There are three different types of data stored blocks:

Persistent
Temporary
Indexes

Persistent: Data for persistent tables

Temporary: Data created during the query execution (aggregates, sorting, …)
Indexes: Data for internal indexes

All these block types are transparently loaded into the main memory on demand.
All the block types are handled in the same way.
In-Memory processing

Query 1 Query 2 Query 3

DB RAM

1 Smith

Virtual Storage

8 Exasol – Overview and Concepts ACADEMY

In-Memory processing

Exasol achieves its high performance as a result of innovative main memory algorithms.
Unlike the hard-disk-based algorithms of traditional solutions, Exasol can specifically
access any single value within nanoseconds. The algorithms that process the queries take
advantage of these characteristics of the main memory and thus enable optimum
performance. Exasol further enhances performance by automatically adjusting the
contents of main memory according to the respective usage profile.
Upon completion of a write operation, data is commited to the local hard disks. Built-in
redundant data distribution also guarantees high database availability.
This method of data processing is fully transparent for the user.
Hardware

– Utilization of commodity hardware - Industry-Standard 19'' Server

– Clustering of a (large) number of low-cost
components
– Free vendor choice:
Dell, HP, IBM, FSC, Oracle (Sun) …

– 2 Hexa/Ten/Twelve Core CPUs

– 16 – 786 GB RAM
– 2 – 24 SAS/SATA HDD
– GBit Ethernet (1GiB, 10GiB)

9 Exasol – Overview and Concepts ACADEMY

Hardware

Exasol is implemented to work with low-cost commodity hardware. Exasol typically

operates on a cluster of powerful 19'' Intel servers.
Typically each server will be configured as follows:
•2 Intel Xeon CPUs each with 8 up to 12 cores,
•16 to 786 GB RAM and
•2 to 24 SATA or SAS hard disks.
Network connectivity is typically based on standard GBit Ethernet.
Such servers can be delivered by nearly every hardware vendor.
Exasol Logical Limits

– Maximum number of schema objects within a database (tables, views, functions, scripts):
– 250,000
– Maximum number of columns per table:
– 10,000
– Identifier length:
– 128 Characters
– Supported Character Sets:
– UTF8
– ASCII

10 Exasol – Overview and Concepts ACADEMY

Exasol Logical Limits

The listed limits may change in future versions of Exasol.

What Is Sap Abap Data Dictionary (SE11)
No ratings yet
What Is Sap Abap Data Dictionary (SE11)
37 pages
ABAP Technical
100% (1)
ABAP Technical
32 pages
Data Dictionary Assignment
No ratings yet
Data Dictionary Assignment
9 pages
EXASOL User Manual 6.1.0 en
No ratings yet
EXASOL User Manual 6.1.0 en
514 pages
Abap Latest Interviews
No ratings yet
Abap Latest Interviews
32 pages
Dbms Viva Questions
No ratings yet
Dbms Viva Questions
14 pages
EXASOL User Manual 6.0.9 en
No ratings yet
EXASOL User Manual 6.0.9 en
512 pages
(Database Management System (DBMS) PDF
No ratings yet
(Database Management System (DBMS) PDF
136 pages
02 Modul Exasol SQL - en
No ratings yet
02 Modul Exasol SQL - en
41 pages
DBMS Interview Questions and Answers
No ratings yet
DBMS Interview Questions and Answers
5 pages
Dbms Interview Questions
No ratings yet
Dbms Interview Questions
11 pages
Abap Test
No ratings yet
Abap Test
7 pages
Dbms Class Viii
No ratings yet
Dbms Class Viii
6 pages
DBMS Viva Questions MCA Idol
100% (10)
DBMS Viva Questions MCA Idol
14 pages
Q1. What Are The Advantages of Database System? Explain Them Briefly. Jan - Feb 2005, Jul 2007
No ratings yet
Q1. What Are The Advantages of Database System? Explain Them Briefly. Jan - Feb 2005, Jul 2007
8 pages
DBMS OS CN OOPs MostFrequentlyAskedQuestions
No ratings yet
DBMS OS CN OOPs MostFrequentlyAskedQuestions
91 pages
Data Dictionary: Surendra Nadh
No ratings yet
Data Dictionary: Surendra Nadh
45 pages
Document 1
No ratings yet
Document 1
6 pages
SAP ABAP DDIC Interview Guide
No ratings yet
SAP ABAP DDIC Interview Guide
26 pages
DBMS Viva Questions Guide
No ratings yet
DBMS Viva Questions Guide
18 pages
DBMS Solved Paper
100% (1)
DBMS Solved Paper
39 pages
DBMS Viva
No ratings yet
DBMS Viva
52 pages
Krishna Raut (2068884) - Assignment 2
No ratings yet
Krishna Raut (2068884) - Assignment 2
30 pages
ABAP Data Dictionary
No ratings yet
ABAP Data Dictionary
37 pages
Dbms Cheat Sheet
100% (5)
Dbms Cheat Sheet
5 pages
DBMS Interview Prep Guide
No ratings yet
DBMS Interview Prep Guide
19 pages
What Is Kernal Badi? What Is The Difference Between Classic Badi and Kernal Badi ?
No ratings yet
What Is Kernal Badi? What Is The Difference Between Classic Badi and Kernal Badi ?
9 pages
DBMS Interview Prep Guide
No ratings yet
DBMS Interview Prep Guide
22 pages
Advanced DBMS Viva :: New Edition
No ratings yet
Advanced DBMS Viva :: New Edition
33 pages
Oracle 11g Murali Naresh Technology
83% (12)
Oracle 11g Murali Naresh Technology
127 pages
DBMS Sem Imp C
No ratings yet
DBMS Sem Imp C
11 pages
DBMS Interview Questions
No ratings yet
DBMS Interview Questions
16 pages
Education Presentation ABAP Week-8
No ratings yet
Education Presentation ABAP Week-8
13 pages
Answers To Some ABAP Interview Questions
No ratings yet
Answers To Some ABAP Interview Questions
10 pages
DBMS Viva Questions
No ratings yet
DBMS Viva Questions
16 pages
Dbms Scheme End Exam
No ratings yet
Dbms Scheme End Exam
52 pages
C Language
No ratings yet
C Language
8 pages
DBMS Interview Questions
No ratings yet
DBMS Interview Questions
12 pages
SAP R/3 Architecture: Domain and Data Elements in SAP
No ratings yet
SAP R/3 Architecture: Domain and Data Elements in SAP
20 pages
DBMS Oral Questions
No ratings yet
DBMS Oral Questions
15 pages
Sap Abap Basics
No ratings yet
Sap Abap Basics
28 pages
ABAP Dictionary Interview Questions With Answers / Dictionary FAQ
No ratings yet
ABAP Dictionary Interview Questions With Answers / Dictionary FAQ
12 pages
DBMS
No ratings yet
DBMS
38 pages
DBMS Viva Q&a
No ratings yet
DBMS Viva Q&a
5 pages
DBMS Interview Questions
No ratings yet
DBMS Interview Questions
19 pages
Level 1 Fundamentals of SAP АВАР: ABAP Control Structures
No ratings yet
Level 1 Fundamentals of SAP АВАР: ABAP Control Structures
61 pages
EXASOL User Manual 6.0.0 en
No ratings yet
EXASOL User Manual 6.0.0 en
504 pages
ELNing 4
No ratings yet
ELNing 4
14 pages
DBMS Disadvantages and Characteristics
No ratings yet
DBMS Disadvantages and Characteristics
57 pages
DBMS Concepts and Advantages
No ratings yet
DBMS Concepts and Advantages
38 pages
Top 52 DBMS Interview Questions 2023 Javatpoint
No ratings yet
Top 52 DBMS Interview Questions 2023 Javatpoint
2 pages
DBMS Basics: Concepts, Advantages, and Key Features
No ratings yet
DBMS Basics: Concepts, Advantages, and Key Features
17 pages
Mid Term Exam DBMS, May 2023-1
No ratings yet
Mid Term Exam DBMS, May 2023-1
17 pages
SAP & ERP Essentials for Businesses
No ratings yet
SAP & ERP Essentials for Businesses
49 pages
1 New Unit I Date
No ratings yet
1 New Unit I Date
15 pages
1-Wildfire Architecture Overview PDF
No ratings yet
1-Wildfire Architecture Overview PDF
22 pages
Fibre Optics
No ratings yet
Fibre Optics
57 pages
C Programming Basics Quiz
No ratings yet
C Programming Basics Quiz
6 pages
Complexity Theory and Big O Notation
No ratings yet
Complexity Theory and Big O Notation
21 pages
LAB05 SCOR - Configure Cisco Firepower NGFW Discovery and IPS Policy
No ratings yet
LAB05 SCOR - Configure Cisco Firepower NGFW Discovery and IPS Policy
31 pages
Ball - Animation Slides
No ratings yet
Ball - Animation Slides
36 pages
BFS and DFS Algorithms Explained
No ratings yet
BFS and DFS Algorithms Explained
29 pages
Controller Design
No ratings yet
Controller Design
253 pages
Codebook Swo3
No ratings yet
Codebook Swo3
144 pages
SGP 22-v3 1
No ratings yet
SGP 22-v3 1
501 pages
A. Introduction Handouts
No ratings yet
A. Introduction Handouts
6 pages
Computer System Architecture Guide
No ratings yet
Computer System Architecture Guide
30 pages
Newest Repair Tools and Spare Parts-Tiff2024
No ratings yet
Newest Repair Tools and Spare Parts-Tiff2024
42 pages
SATEC Catalog
No ratings yet
SATEC Catalog
28 pages
Marshall Manual
No ratings yet
Marshall Manual
9 pages
ReleaseNote - FileList of G532LWS - 2009 - X64 - V2.01
No ratings yet
ReleaseNote - FileList of G532LWS - 2009 - X64 - V2.01
6 pages
Entry-Level Web Developer Profile
No ratings yet
Entry-Level Web Developer Profile
2 pages
Definition and Types of Modeling and Simulation FINAL
No ratings yet
Definition and Types of Modeling and Simulation FINAL
15 pages
Days of Innocence and Wonder Lucy Treloar Official Test Bank
No ratings yet
Days of Innocence and Wonder Lucy Treloar Official Test Bank
406 pages
Transportation Analytics
No ratings yet
Transportation Analytics
47 pages
AdityaRai Task3
No ratings yet
AdityaRai Task3
56 pages
SchoolBus Web Studyguide 2019
100% (1)
SchoolBus Web Studyguide 2019
44 pages
Linux Certification Essentials
No ratings yet
Linux Certification Essentials
150 pages
B.Tech CSE Provisional Grade Sheet
No ratings yet
B.Tech CSE Provisional Grade Sheet
4 pages
02SOP-Outlook Android
No ratings yet
02SOP-Outlook Android
8 pages
Powerpoint Dissertation Proposal
100% (2)
Powerpoint Dissertation Proposal
5 pages
Starting From SCRATCH: An Introduction To Computing Science - Scratching The Surface
No ratings yet
Starting From SCRATCH: An Introduction To Computing Science - Scratching The Surface
9 pages
Resume: Personal Information
No ratings yet
Resume: Personal Information
3 pages
Cisco Live Introduction To SRv6 uSID Technology-2
No ratings yet
Cisco Live Introduction To SRv6 uSID Technology-2
129 pages
USB Dongle Setup for RES2DINV/3DINV
No ratings yet
USB Dongle Setup for RES2DINV/3DINV
1 page

Exasol: Tuning-Free Database Guide

Uploaded by

Exasol: Tuning-Free Database Guide

Uploaded by

ACADEMY

Exasol - Overview and Concepts

– Strong cost-based query optimizer

– Automatic resource management

– Automatic data distribution, compression, …

2 Exasol – Overview and Concepts ACADEMY

Base idea: Creating a tuning free database

2014-11-28 534 93000 … … … … … … 2014-06-21 661 94447

id sales_date market id sales_date market

3 Exasol – Overview and Concepts ACADEMY

Massively Parallel Data Processing

– Table values are stored column wise

4 Exasol – Overview and Concepts ACADEMY

5 Exasol – Overview and Concepts ACADEMY

– Several values of one column are collected within a block

1 2014-04-08 49.91 678

2 2014-06-21 54.65 661

7 2014-09-17 43.14 661

6 Exasol – Overview and Concepts ACADEMY

– Three different types of data blocks:

– All block types are transparently loaded into RAM on demand

7 Exasol – Overview and Concepts ACADEMY

Data block types

There are three different types of data stored blocks:

Persistent: Data for persistent tables

Query 1 Query 2 Query 3

8 Exasol – Overview and Concepts ACADEMY

– Utilization of commodity hardware - Industry-Standard 19'' Server

– 2 Hexa/Ten/Twelve Core CPUs

9 Exasol – Overview and Concepts ACADEMY

Exasol is implemented to work with low-cost commodity hardware. Exasol typically

10 Exasol – Overview and Concepts ACADEMY

Exasol Logical Limits

The listed limits may change in future versions of Exasol.

You might also like