0% found this document useful (0 votes)

139 views32 pages

MySQL Cluster - Voxxed Days Belgrade 2015

- MySQL Cluster is a distributed database that provides high availability, horizontal scalability and in-memory performance. It powers many large-scale web and cloud applications. - It uses a shared-nothing architecture where data is split into fragments and replicated across nodes in a cluster. This allows it to scale horizontally by adding more nodes. - Industry leaders across sectors such as social media, content delivery and cloud rely on MySQL Cluster for its ability to handle massive amounts of dynamically generated data at low latency.

Uploaded by

arhismece

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

139 views32 pages

MySQL Cluster - Voxxed Days Belgrade 2015

Uploaded by

arhismece

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

MySQL Cluster

tips & tricks

Bogdan Kecman
MySQL Principal Technical Engineer
Bogdan.Kecman@oracle.com
1
Safe Harbor Statement
The following is intended to outline our general product direction. It is intended for information purposes only, and may
not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should
not be relied upon in making purchasing decisions. The development, release, and timing of any features or
functionality described for Oracle’s products remains at the sole discretion of Oracle.

2
Industry Leaders Rely on MySQL

Web & Enterprise OEM & ISVs

Cloud
3
MySQL Powers The Web

Over 50 million Tweets/day. 143,200 Tweets/sec in Aug 2013

”Many petabytes” of data. 11.2 Million Row changes & 2.5 billion
rows read /sec handled in MySQL

6 billion hours of video watched each month

Globally-distributed database with 100 terabytes of user-related

data based on MySQL Cluster

4
The #1 Database in the Cloud

SaaS

Hosting IaaS, PaaS

5
Architectural overview

MySQL Cluster Data Nodes InnoDB MyISAM SE s1 SE s2

6
Who’s Using MySQL Cluster?
MySQL Cluster Architecture

8
MySQL Cluster NODES (1/2)

• Data nodes (ndbd or ndbmtd)

• Stores data and indexes
• In memory
• Non-indexed data can be stored on disk
• Stores schema definition
• Check pointed to disk
• Transaction coordination
• Online backup
• All connect to each other
• Management nodes (ndb_mgmd)
• Distributing configuration
• Logging
• Monitoring
• Act as Arbitrator to prevent split-brain scenarios
• Not crucial for Cluster operation (if not working cluster still works properly,
only limit is that no new nodes can start if no mgm nodes are around)
• One is enough, two are perfect, three are too much
9
MySQL Cluster NODES (2/2)

• SQL nodes (mysqld)

• The NDBCLUSTER Storage Engine is actually API node
• Transparent for most applications
• Used to create tables
• Used for geo-replication
• Can act as arbitrator
• Connects to all Data Nodes
• API nodes (your own executable)
• Application written using NDB API
• C
• C++
• Java
• FAST
• No SQL parsing
• No Optimizer
• Examples
• Ndbcluster storage engine, ndb_restore, memcached-ndbcluster plugin, ldap-ndbcluster…
10
High Availability

• Fragmentation
• Table data is split among data nodes

• Synchronous replication
• Each fragment is stored NoOfReplicas times

• Heartbeating

• Automatic failover

• Online backups

• Online Updates

11
High Performance

• Performance boost
• In-memory
• Shared IO load
• As many SQL/API nodes as you like (up to 200)
• Direct access trough NDB API

• Performance killers
• Network (latency creates big problem)
• With 1gbit nic up to 10 data nodes works without problem
• With modern 10gbit nic up to 40 data nodes can run ok
• Joins
• Huge improvements in 7.4 but still joins on distributed data are always
going to suffer performance, especially on slower network
• Blobs
12
Scaling

• Up to 254 nodes

• Add data nodes on-line

• Note that adding data nodes does not only increase storage capacity
but also increase your IO capacity too.

• Geographical Replication
• Multi channel replication (note it is always idempotent)

13
Accessing NDBCLUSTER data
• SQL (via MySQL connector/php,java,ruby,python..., odbc, MySQL C-API..)
• MEMCACHED (add ndbcluster driver to memcached server)
• ndbAPI (C/C++ API)
• ClusterJ, JPA, ClusterJPA, LDAP
Apps Apps Apps Apps Apps Apps Apps Apps Apps Apps Apps Apps

JPA
ClusterJPA

PHP PERL Python Ruby JDBC ClusterJ JSON Apache Memcached

MySQL JNI Node.js mod-ndb ndb-eng

NDB API (C++)

MySQL Cluster Data Nodes

14
ClusterJ/JPA

• Domain Object Model Persistence API (ClusterJ):

• Java API
• High performance, low latency
• Feature rich
• JPA interface built upon this new Java layer:
• Java Persistence API compliant
• Implemented as an OpenJPA plugin
• Uses ClusterJ where possible, reverts to JDBC for some
operations
• Higher performance than JDBC
• More natural for most Java designers
• Easier Cluster adoption for web applications
Memcached

• Memcached is a distributed memory based hash-

key/value store with no persistence to disk
• NoSQL, simple API, popular with developers
• MySQL Cluster already provides scalable, in-memory
performance with NoSQL (hashed) access as well as
persistence
• Provide the Memcached API but map to NDB API calls
• Writes-in-place, so no need to invalidate cache
• Simplifies architecture as caching & database integrated
into 1 tier
• Access data from existing relational tables
Traditional Memcached Architecture

httpd memcached

hash key
PHP/Perl memcached
to find data
Memcache
friends:12389 memcached
memcache key
NDB & Memcache Architecture: Memcache protocol + NDB storage

MySQL
Cluster
Application
memcached Data Node

Memcache NDB Engine MySQL

Client Cluster
Data Node
Memcached/MySQL Cluster latency

memcachetest -t 2 -M 7000 -c 25000

Cluster & Memcached – Configured Schema

key value
<town:maidenhead,SL6>
Application view
SQL view prefix key value

<town:maidenhead,SL6>

Prefix Table Key-col Val-col policy town ... code ...

town: map.zip town code cluster maidenhead ... SL6 ...

Config tables map.zip

Node.js NoSQL API
• Native JavaScript access to MySQL Cluster
–End-to-End JavaScript: browser to the app and database
Clients –Storing and retrieving JavaScript objects directly in
MySQL Cluster
–Eliminate SQL transformation
• Implemented as a module for node.js
V8 JavaScript Engine –Integrates full Cluster API library within the web app
• Couple high performance, distributed apps, with high
MySQL Cluster Node.js Module performance distributed database

MySQL Cluster Data Nodes

MySQL Cluster NoSQL API for Node.js
Application Code
// Constructor // Create a tweet
function Tweet(user, message) { function newTweet(
this.id = UUID.generate(); err, dbSession, httpReq){
this.timestamp = Date.now(); var tweet = new Tweet(
this.user = user; httpReq.user,
this.message = message; httpReq.message);
} dbSession.persist(tweet);
}
// Server Startup
var nosql = require('mysql-js'); function onNewTweetRequest(
err, httpReq){
var sessionFactory =
nosql.connectSync('ndb'); essionFactory.openSession(
null, newTweet, httpReq);
nosql.mapClass(Tweet, 'tweets');
}
Need for Speed?
NDB API
•C++ programming interface
•Provides direct access to data nodes
•No MySQL server needed (still it is recommended to use
MySQL to manipulate schema)
•No SQL layer (no parser, no optimizer, no …)
•Query batching
•Async transactions
•NDB Events
Can’t all be that good?

•Portability (all requests are hardcoded into your application)

•Less flexibility
–schema changes need to be hardcoded into your app
–to change a simple query you have to change and recompile your c++ code
•No privileges (everyone have access to everything)
•No security (if you can telnet to a data node port you have
access to everything)
•No triggers, views, stored procedures
•No auditting
NDB API
Life of a transaction
1. Start transaction
2. Define operations
3. Execute operations
4. Commit / Abort transaction
NDB API
Starting a Transaction
•A transaction is started by getting an NdbTransaction object
•An Ndb object can have maximum of 1024 parallel transactions

NdbTransaction * t = ndb->startTransaction();
if (t == NULL){
printerr(“could not start transaction\n”);
return (-1);
}
NDB API
Getting an NdbOperation Object
•An NdbOperation object is created with the getNdbOperation
•A table name or a NdbDictionary::Table* needs to be provided

NdbOperation * op = t->getNdbOperation(“tab1”);
if (op == NULL){
//handle error
}
NDB API
Defining the operation type
•insertTuple()
•readTuple()
•writeTuple()
•updateTuple()
•deleteTuple()

op->readTuple();
NDB API EXAMPLE (pk access)
NdbTransaction * trans = ndb->startTransaction();
if (trans == NULL){ printerr(“could not start transaction\n”); return (-1); }
NdbOperation * op = trans->getNdbOperation(“City”);
if (op == NULL) return -1;
op->readTuple();

int idvalue = 3236;

op->equal(“ID”, idvalue);

int population = 0;
op->getValue(“Population”, (char*)&population);

char name[35];
op->getValue(“Name”, name);

if (trans->execute(NdbTransaction::Commit, NdbOperation::AbortOnError, 1)
== -1){
printerr(“transaction was not successful\n”);
return (-1);
}
trans->close();
printf(“The City %s has the population of %d\n”, name, population);
NDB API
Joining tables
•Joining tables with NDB API is way more complex then with SQL
•The basic principle is easy – nested FOR loops
•The method for retrieving the rows depends on the tables
involved and possible indexes
•It is very difficult to do dynamic optimization
•In principle the join method has to be decided when creating
the program (coding time)
SQL vs NDB API speed comparison (reads/second – note logarithmic Y axes)
Thank You!

Questions?

Bogdan Kecman
MySQL Principal Technical Engineer
Bogdan.Kecman@oracle.com 32

MySQL Cluster Sometimes SQL UC2011
No ratings yet
MySQL Cluster Sometimes SQL UC2011
31 pages
Case Studies
No ratings yet
Case Studies
2 pages
Database-Lecture 1
No ratings yet
Database-Lecture 1
39 pages
Geert Vanderkelen MySQL Cluster
No ratings yet
Geert Vanderkelen MySQL Cluster
52 pages
Nosql + SQL Mysql: Mysql Document Store Architecture
No ratings yet
Nosql + SQL Mysql: Mysql Document Store Architecture
4 pages
Mysql Cluster Datasheet
No ratings yet
Mysql Cluster Datasheet
5 pages
41 NoSQL Introduction
No ratings yet
41 NoSQL Introduction
18 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
43 pages
Considerations For Using NoSQL Technology On Your Next IT Project
No ratings yet
Considerations For Using NoSQL Technology On Your Next IT Project
398 pages
Dad Assignment
No ratings yet
Dad Assignment
20 pages
Distributed Database Systems: Vera Goebel
No ratings yet
Distributed Database Systems: Vera Goebel
58 pages
Distributed Dbs
No ratings yet
Distributed Dbs
58 pages
No SQL
No ratings yet
No SQL
32 pages
Hbase in Practice
No ratings yet
Hbase in Practice
46 pages
NoSQL vs RDBMS: A Modern Shift
100% (1)
NoSQL vs RDBMS: A Modern Shift
142 pages
BIG Data Analytics 21CSH-471: Computer Science & Engineering
No ratings yet
BIG Data Analytics 21CSH-471: Computer Science & Engineering
32 pages
Overview of Database
No ratings yet
Overview of Database
25 pages
DBMS Architectures for Students
No ratings yet
DBMS Architectures for Students
30 pages
Unit III FSWD Mongodb
No ratings yet
Unit III FSWD Mongodb
40 pages
Introduction To Mysql Cluster: Architecture and Use: (Based On An Original Paper by Stewart Smith, Mysql Ab)
No ratings yet
Introduction To Mysql Cluster: Architecture and Use: (Based On An Original Paper by Stewart Smith, Mysql Ab)
7 pages
BIG - DATA - Unit 4
No ratings yet
BIG - DATA - Unit 4
99 pages
Mongodb Introductioninstalaltion and Basic Crud Operations
No ratings yet
Mongodb Introductioninstalaltion and Basic Crud Operations
53 pages
DBMS PPT 1 Eng
No ratings yet
DBMS PPT 1 Eng
74 pages
NoSQL Database Overview Lecture
No ratings yet
NoSQL Database Overview Lecture
22 pages
Fdocuments - in Nosql-Seminar
No ratings yet
Fdocuments - in Nosql-Seminar
40 pages
SQL Vs NoSQL Industry Differences
No ratings yet
SQL Vs NoSQL Industry Differences
2 pages
DB Assignment
No ratings yet
DB Assignment
20 pages
No SQL
No ratings yet
No SQL
109 pages
MySQL Cluster Deployment Guide
No ratings yet
MySQL Cluster Deployment Guide
39 pages
Unit 6
No ratings yet
Unit 6
143 pages
S-Advance Database Management System 1
No ratings yet
S-Advance Database Management System 1
68 pages
Technical Presentation - MySQL
No ratings yet
Technical Presentation - MySQL
17 pages
Dbms 1
No ratings yet
Dbms 1
23 pages
Database Architecture Basics
No ratings yet
Database Architecture Basics
32 pages
Nosql Notes
No ratings yet
Nosql Notes
9 pages
2.1.SummerSOC2015 Tutorial NoSQL
No ratings yet
2.1.SummerSOC2015 Tutorial NoSQL
62 pages
LDAP For MySQL Cluster Backndb
No ratings yet
LDAP For MySQL Cluster Backndb
34 pages
Nosql Tricks
No ratings yet
Nosql Tricks
34 pages
Cloud Data Storage
No ratings yet
Cloud Data Storage
47 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
43 pages
Lecture 1
No ratings yet
Lecture 1
31 pages
David Baba
No ratings yet
David Baba
9 pages
NoSQL PDF
No ratings yet
NoSQL PDF
21 pages
4.1 Intro Nosql-Converted-133751863122661863
No ratings yet
4.1 Intro Nosql-Converted-133751863122661863
43 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
45 pages
Module 2
No ratings yet
Module 2
42 pages
Distributed Database
No ratings yet
Distributed Database
12 pages
Mysql 5.1 Reference Manual: Including Mysql Cluster NDB 6.X/7.X Reference Guide
No ratings yet
Mysql 5.1 Reference Manual: Including Mysql Cluster NDB 6.X/7.X Reference Guide
0 pages
Introducing Relational Database Products-2
No ratings yet
Introducing Relational Database Products-2
43 pages
Distributed Databases: Daniel Marcous
No ratings yet
Distributed Databases: Daniel Marcous
41 pages
Lecture 1 - NoSQL
No ratings yet
Lecture 1 - NoSQL
31 pages
Lec 6 - Big Data Storage Technologies II - NoSQL
No ratings yet
Lec 6 - Big Data Storage Technologies II - NoSQL
20 pages
PPT
100% (1)
PPT
36 pages
DBMS
No ratings yet
DBMS
43 pages
JDBC
No ratings yet
JDBC
11 pages
Bcse302l Dbms Module-7 Nosql
No ratings yet
Bcse302l Dbms Module-7 Nosql
30 pages
Database System Architecture
No ratings yet
Database System Architecture
21 pages
NoSQL for Tech Professionals
No ratings yet
NoSQL for Tech Professionals
29 pages
Tuples Relational Calculus Guide
No ratings yet
Tuples Relational Calculus Guide
14 pages
Equifax SQL Injection
No ratings yet
Equifax SQL Injection
6 pages
Hospital Information Systems Overview
No ratings yet
Hospital Information Systems Overview
11 pages
PeopleSoft Training Guide
No ratings yet
PeopleSoft Training Guide
249 pages
DBMS III Exxp
No ratings yet
DBMS III Exxp
9 pages
PL-300 Exam Prep for Power BI Analysts
No ratings yet
PL-300 Exam Prep for Power BI Analysts
35 pages
Item ID
No ratings yet
Item ID
75 pages
Advanced DBMS Course Overview
No ratings yet
Advanced DBMS Course Overview
129 pages
Big Data Unit-3
No ratings yet
Big Data Unit-3
46 pages
Python Programming Exercises
No ratings yet
Python Programming Exercises
60 pages
Lab Assignment 4 - 7
No ratings yet
Lab Assignment 4 - 7
7 pages
How To Manually Remove A Tape Completely From NetBackup-Media Manager
No ratings yet
How To Manually Remove A Tape Completely From NetBackup-Media Manager
3 pages
Upgrade Procedure - ReadSoft - PDAP 7.10 - v1.0
No ratings yet
Upgrade Procedure - ReadSoft - PDAP 7.10 - v1.0
12 pages
Django Models for Beginners
No ratings yet
Django Models for Beginners
3 pages
Database Systems Study Guide
No ratings yet
Database Systems Study Guide
8 pages
A Study On Data Deduplication Techniques For Optimized Storage
No ratings yet
A Study On Data Deduplication Techniques For Optimized Storage
7 pages
Dbms Lab Manual 10csl57
100% (1)
Dbms Lab Manual 10csl57
38 pages
MSSQL Server Backup and Recovery
No ratings yet
MSSQL Server Backup and Recovery
9 pages
Wms Patches
No ratings yet
Wms Patches
8 pages
What Is A CDS View?: 1. Advanced Data Modelling Features
No ratings yet
What Is A CDS View?: 1. Advanced Data Modelling Features
12 pages
Full Roadmap - Data Analyst
No ratings yet
Full Roadmap - Data Analyst
12 pages
Physical Files
No ratings yet
Physical Files
18 pages
Isilon - Understanding PowerScale OneFS Locking, Deadlocks, and Hangdumps - Dell India
No ratings yet
Isilon - Understanding PowerScale OneFS Locking, Deadlocks, and Hangdumps - Dell India
3 pages
Dicom and Pacs
No ratings yet
Dicom and Pacs
28 pages
Android System Log Analysis
No ratings yet
Android System Log Analysis
812 pages
Spectrum Archive VM
No ratings yet
Spectrum Archive VM
83 pages
Create Table Employee
100% (1)
Create Table Employee
2 pages
DBMS Lab: Car Database Assignment
No ratings yet
DBMS Lab: Car Database Assignment
2 pages
Data Warehousing: Online Analytical Processing (OLAP)
No ratings yet
Data Warehousing: Online Analytical Processing (OLAP)
44 pages
CyberArk Issues
No ratings yet
CyberArk Issues
3 pages

MySQL Cluster - Voxxed Days Belgrade 2015

Uploaded by

MySQL Cluster - Voxxed Days Belgrade 2015

Uploaded by

MySQL Cluster

tips & tricks

Web & Enterprise OEM & ISVs

Over 50 million Tweets/day. 143,200 Tweets/sec in Aug 2013

6 billion hours of video watched each month

Globally-distributed database with 100 terabytes of user-related

Hosting IaaS, PaaS

MySQL Cluster Data Nodes InnoDB MyISAM SE s1 SE s2

• Data nodes (ndbd or ndbmtd)

• SQL nodes (mysqld)

• Add data nodes on-line

PHP PERL Python Ruby JDBC ClusterJ JSON Apache Memcached

MySQL JNI Node.js mod-ndb ndb-eng

NDB API (C++)

MySQL Cluster Data Nodes

• Domain Object Model Persistence API (ClusterJ):

• Memcached is a distributed memory based hash-

Memcache NDB Engine MySQL

memcachetest -t 2 -M 7000 -c 25000

Prefix Table Key-col Val-col policy town ... code ...

Config tables map.zip

MySQL Cluster Data Nodes

•Portability (all requests are hardcoded into your application)

int idvalue = 3236;

You might also like