0% found this document useful (0 votes)

115 views46 pages

Hbase in Practice

The document summarizes a presentation on HBase in practice. It begins with introducing the speaker and their experience with HBase and Apache projects. The bulk of the document then covers core HBase concepts like tables, regions, column families and data modeling best practices. It also discusses the different APIs and access options for HBase as well as techniques for performance tuning a HBase cluster.

Uploaded by

Diego Fernandes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

115 views46 pages

Hbase in Practice

Uploaded by

Diego Fernandes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

DataWorks Summit 2017 - Munich

HBase in Practice
NoSQL is no SQL is SQL?

Website: www.opencore.com
Agenda
• Brief Intro To Core Concepts
• Access Options
• Data Modelling
• Performance Tuning
• Use-Cases
• Summary
Introduction To Core Concepts
HBase Tables
• From user perspective, HBase is similar to a database, or spreadsheet
• There are rows and columns, storing values
• By default asking for a specific row/column combination returns the
current value (that is, that last value stored there)
HBase Tables
• HBase can have a
different schema
per row
• Could be called
schema-less
• Primary access by
the user given row
key and column
name
• Sorting of rows and
columns by their
key (aka names)
HBase Tables
• Each row/column coordinate is tagged with a version number, allowing
multi-versioned values
• Version is usually
the current time
(as epoch)
• API lets user ask
for versions
(specific, by count,
or by ranges)
• Up to 2B versions
HBase Tables
• Table data is cut into pieces to distribute over cluster
• Regions split table into
shards at size boundaries
• Families split within
regions to group
sets of columns
together
• At least one of
each is needed
Scalability – Regions as Shards
• A region is served by exactly
one region server
• Every region server serves
many regions
• Table data is spread over servers
• Distribution of I/O
• Assignment is based on
configurable logic
• Balancing cluster load
• Clients talk directly to region
servers
Column Family-Oriented

• Group multiple columns into

physically separated locations
• Apply different properties to each
family
• TTL, compression, versions, …
• Useful to separate distinct data
sets that are related
• Also useful to separate larger blob
from meta data
Data Management
• What is available is tracked in three
locations
• System catalog table hbase:meta
• Files in HDFS directories
• Open region instances on servers
• System aligns these locations
• Sometimes (very rarely) a repair may
be needed using HBase Fsck
• Redundant information is useful to
repair corrupt tables
HBase really is….
• A distributed Hash Map
• Imagine a complex, concatenated key including the user given row key and
column name, the timestamp (version)
• Complex key points to actual value, that is, the cell
Fold, Store, and Shift
• Logical rows in tables are
really stored as flat key-value
pairs
• Each carries full coordinates
• Pertinent information can be
freely placed in cell to
improve lookup

• HBase is a column-family
grouped key-value store
HFile Format Information
• All data is stored in a custom (open-source) format, called HFile
• Data is stored in blocks (64KB default)
• Trade-off between lookups and I/O throughput
• Compression, encoding applied _after_ limit check
• Index, filter and meta data is stored in separate blocks
• Fixed trailer allows traversal of file structure
• Newer versions introduce multilayered index and filter structures
• Only load master index and load partial index blocks on demand
• Reading data requires deserialization of block into cells
• Kind of Amdahl’s Law applies
HBase Architecture
• One Master and many Worker servers
• Clients mostly communicate with workers
• Workers store actual data
• Memstore for accruing
• HFile for persistence
• WAL for fail-safety
• Data provided as regions
• HDFS is backing store
• But could be another
HBase Architecture (cont.)
HBase Architecture (cont.)
• Based on Log-Structured Merge-Trees (LSM-Trees)
• Inserts are done in write-ahead log first
• Data is stored in memory and flushed to disk on regular intervals or based
on size
• Small flushes are merged in the background to keep number of files small
• Reads read memory stores first and then disk based files second
• Deletes are handled with “tombstone”
markers
• Atomicity on row level no matter how
many columns
• Keeps locking model easy
Merge Reads
• Read Memstore & StoreFiles
using separate scanners
• Merge matching cells into
single row “view”
• Delete’s mask existing data
• Bloom filters help skip
StoreFiles

• Reads may have to span

many files
APIs and Access Options
HBase Clients
• Native Java Client/API
• Non-Java Clients
• REST server
• Thrift server
• Jython, Groovy DSL
• Spark
• TableInputFormat/TableOutputFormat for MapReduce
• HBase as MapReduce source and/or target
• Also available for table snapshots
• HBase Shell
• JRuby shell adding get, put, scan etc. and admin calls
• Phoenix, Impala, Hive, …
Java API
From Wikipedia:
• CRUD: “In computer programming, create, read, update, and delete are the
four basic functions of persistent storage.”
• Other variations of CRUD include
• BREAD (Browse, Read, Edit, Add, Delete)
• MADS (Modify, Add, Delete, Show)
• DAVE (Delete, Add, View, Edit)
• CRAP (Create, Retrieve, Alter, Purge)
Java API (cont.)
• CRUD
• put: Create and update a row (CU)
• get: Retrieve an entire, or partial row (R)
• delete: Delete a cell, column, columns, or row (D)
• CRUD+SI
• scan: Scan any number of rows (S)
• increment: Increment a column value (I)
• CRUD+SI+CAS
• Atomic compare-and-swap (CAS)
• Combined get, check, and put operation
• Helps to overcome lack of full transactions
Java API (cont.)
• Batch Operations
• Support Get, Put, and Delete
• Reduce network round-trips
• If possible, batch operation to the server to gain better overall throughput
• Filters
• Can be used with Get and Scan operations
• Server side hinting
• Reduce data transferred to client
• Filters are no guarantee for fast scans
• Still full table scan in worst-case scenario
• Might have to implement your own
• Filters can hint next row key
Data Modeling
Where’s your data at?
Key Cardinality

• The best performance is gained from using row keys

• Time range bound reads can skip store files
• So can Bloom Filters
• Selecting column families
reduces the amount of data
to be scanned
• Pure value based access
is a full table scan
• Filters often are too, but
reduce network traffic
Key/Table Design
• Crucial to gain best performance
• Why do I need to know? Well, you also need to know that RDBMS is only working
well when columns are indexed and query plan is OK
• Absence of secondary indexes forces use of row key or column name
sorting
• Transfer multiple indexes into one
• Generate large table -> Good since fits architecture and spreads across cluster
• DDI
• Stands for Denormalization, Duplication and Intelligent Keys
• Needed to overcome trade-offs of architecture
• Denormalization -> Replacement for JOINs
• Duplication -> Design for reads
• Intelligent Keys -> Implement indexing and sorting, optimize reads
Pre-materialize Everything
• Achieve one read per customer request if possible
• Otherwise keep at lowest number
• Reads between 10ms (cache miss) and 1ms (cache hit)
• Use MapReduce or Spark to compute exacts in batch
• Store and merge updates live
• Use increment() methods

Motto: “Design for Reads”

Tall-Narrow vs. Flat-Wide Tables
• Rows do not split
• Might end up with one row per region
• Same storage footprint
• Put more details into the row key
• Sometimes dummy column only
• Make use of partial key scans
• Tall with Scans, Wide with Gets
• Atomicity only on row level
• Examples
• Large graphs, stored as adjacency matrix (narrow)
• Message inbox (wide)
Sequential Keys
<timestamp><more key>: {CF: {CQ: {TS : Val}}}

• Hotspotting on regions is bad!

• Instead do one of the following:
• Salting
• Prefix <timestamp> with distributed value
• Binning or bucketing rows across regions
• Key field swap/promotion
• Move <more key> before the timestamp (see OpenTSDB)
• Randomization
• Move <timestamp> out of key or prefix with MD5 hash
• Might also be mitigated by overall spread of workloads
Key Design Choices

• Based on access pattern, either use

sequential or random keys
• Often a combination of both is needed
• Overcome architectural limitations
• Neither is necessarily bad
• Use bulk import for sequential keys and
reads
• Random keys are good for random access
patterns
Checklist
• Design for Use-Case
• Read, Write, or Both?
• Avoid Hotspotting
• Hash leading key part, or use salting/bucketing
• Use bulk loading where possible
• Monitor your servers!
• Presplit tables
• Try prefix encoding when values are small
• Otherwise use compression (or both)
• For Reads: Restrict yourself
• Specify what you need, i.e. columns, families, time range
• Shift details to appropriate position
• Composite Keys
• Column Qualifiers
Performance Tuning
1000 knobs to turn… 20 are important?
Everything is Pluggable

• Cell
• Memstore
• Flush Policy
• Compaction
Policy
• Cache
• WAL
• RPC handling
•…
Cluster Tuning
• First, tune the global settings
• Heap size and GC algorithm
• Memory share for reads and writes
• Enable Block Cache
• Number of RPC handlers
• Load Balancer
• Default flush and compaction strategy
• Thread pools (10+)
• Next, tune the per-table and family settings
• Region sizes
• Block sizes
• Compression and encoding
• Compactions
• …
Region Balancer Tuning
• A background process in the HBase
Master is tracking load on servers
• The load balancer moves regions
occasionally
• Multiple implementations exists
• Simple counts number of regions
• Stochastic determines cost
• Favored Node pins HDFS block
replicas
• Can be tuned further
• Cluster-wide setting!
RPC Tuning
• Default is one queue for
all types of requests
• Can be split into
separate queues for
reads and writes
• Read queue can be
further split into reads
and scans

 Stricter resource limits,

but may avoid cross-
starvation
Key Tuning
• Design keys to match use-case
• Sequential, salted, or random
• Use sorting to convey meaning
• Colocate related data
• Spread load over all servers
• Clever key design can make use
of distribution: aging-out regions
Compaction Tuning
• Default compaction settings are aggressive
• Set for update use-case
• For insert use-cases, Blooms are effective
• Allows to tune down compactions
• Saves resources by reducing write amplification
• More store files are also enabling faster full
table scans with time range bound scans
• Server can ignore older files
• Large regions may be eligible for advanced
compaction strategies
• Stripe or date-tiered compactions
• Reduce rewrites to fraction of region size
Use-Cases
What works well, what does not, and what is so-so
Placing the Use-Case
Big Data Workloads

Low
latency
HBase
HDFS
+ SQL

HBase + MR/Spark

HBase + Snapshots
-> HDFS + MR/Spark HDFS + MR
(Hive/Pig)
Batch

Random Access Short Scan Full Scan

Big Data Workloads
Simple Entities
Low
latency Messages
Graph data
HBase
HDFS
Current Metrics Entity Time series + SQL

HBase + MR/Spark Analytic archive

Hybrid Entity Time series
+ Rollup serving
HBase + Snapshots
Index building Hybrid Entity Time+ MR/Spark
HDFS series
-> HDFS + MR/Spark
+ Rollup generation(Hive/Pig)
Batch

Random Access Short Scan Full Scan

Summary
Wrapping it up…
What matters…
• For optimal performance, two things need to be considered:
• Optimize the cluster and table settings
• Choose the matching key schema
• Ensure load is spread over tables and cluster nodes
• HBase works best for random access and bound scans
• HBase can be optimized for larger scans, but its sweet spot is short burst scans (can
be parallelized too) and random point gets
• Java heap space limits addressable space
• Play with region sizes, compaction strategies, and key design to maximize result
• Using HBase for a suitable use-case will make for a happy customer…
• Conversely, forcing it into non-suitable use-cases may be cause for trouble
Questions?
Thank You!
@larsgeorge

Akka PDF
No ratings yet
Akka PDF
454 pages
Hbase PDF
No ratings yet
Hbase PDF
8 pages
HBase Guide for Developers
No ratings yet
HBase Guide for Developers
33 pages
Impala
No ratings yet
Impala
11 pages
Hbase Apache Org Book HTML
No ratings yet
Hbase Apache Org Book HTML
482 pages
Teradata RDBMS Architecture Guide
No ratings yet
Teradata RDBMS Architecture Guide
55 pages
Netezza Performance Server Release Notes
No ratings yet
Netezza Performance Server Release Notes
58 pages
Apache Cassandra: Database
No ratings yet
Apache Cassandra: Database
55 pages
Docker - Part1
No ratings yet
Docker - Part1
3 pages
Create Int Varchar Date Varchar State Varchar: Emp - Piyush Employeeid Empname 30 Dob City 20 20
100% (1)
Create Int Varchar Date Varchar State Varchar: Emp - Piyush Employeeid Empname 30 Dob City 20 20
10 pages
Hive in Class Assignment Winter 2021
No ratings yet
Hive in Class Assignment Winter 2021
2 pages
Syllabus Mysql
100% (2)
Syllabus Mysql
8 pages
Business Intelligence DW
No ratings yet
Business Intelligence DW
17 pages
SS1123 - D2T - Apache Cassandra Overview PDF
100% (1)
SS1123 - D2T - Apache Cassandra Overview PDF
45 pages
MongoDB Manual Master
No ratings yet
MongoDB Manual Master
1,117 pages
Comprehensive Guide to SQL Joins
No ratings yet
Comprehensive Guide to SQL Joins
27 pages
SQL Server Student Guide-1
No ratings yet
SQL Server Student Guide-1
98 pages
YAPP (Oracle) Yet Another Performance Profiling Method
No ratings yet
YAPP (Oracle) Yet Another Performance Profiling Method
28 pages
Informatica 9.x Course Curriculum
No ratings yet
Informatica 9.x Course Curriculum
8 pages
Red Hat JBoss Enterprise Application Platform-7.1-Configuration Guide-en-US
No ratings yet
Red Hat JBoss Enterprise Application Platform-7.1-Configuration Guide-en-US
487 pages
SAGE X3 Budget V12 English Version 1560100062
No ratings yet
SAGE X3 Budget V12 English Version 1560100062
10 pages
DB2 9 DBA Certification Exam 731 Prep, Part 1:: Server Management
No ratings yet
DB2 9 DBA Certification Exam 731 Prep, Part 1:: Server Management
44 pages
Oracle Netapp Best Practices
No ratings yet
Oracle Netapp Best Practices
47 pages
Designer Client Guide
100% (2)
Designer Client Guide
263 pages
Database Systems Introduction
No ratings yet
Database Systems Introduction
35 pages
Netezza Commands
No ratings yet
Netezza Commands
1 page
DataStage Universe Basic SQL Client Interface Guide
No ratings yet
DataStage Universe Basic SQL Client Interface Guide
285 pages
HBase Succinctly PDF
100% (1)
HBase Succinctly PDF
85 pages
Scala PDF
No ratings yet
Scala PDF
29 pages
Spring JDBC
No ratings yet
Spring JDBC
17 pages
Oracle 11gR2 RAC Command Guide
No ratings yet
Oracle 11gR2 RAC Command Guide
5 pages
Oracle Unified and Internet Directory
No ratings yet
Oracle Unified and Internet Directory
7 pages
Oracle Multitenant on Exadata
No ratings yet
Oracle Multitenant on Exadata
31 pages
Database Vault
No ratings yet
Database Vault
18 pages
Oracle Index Types
No ratings yet
Oracle Index Types
4 pages
WebLogic 12c Dynamic Clusters
No ratings yet
WebLogic 12c Dynamic Clusters
8 pages
Flash Recovery Area - Space Management Warning and Alerts
No ratings yet
Flash Recovery Area - Space Management Warning and Alerts
4 pages
Oracle Server Basics & Architecture
No ratings yet
Oracle Server Basics & Architecture
154 pages
BCA 428 Oracle
No ratings yet
BCA 428 Oracle
142 pages
APEX Sessions
No ratings yet
APEX Sessions
14 pages
Move Oracle Datafiles in Asm
No ratings yet
Move Oracle Datafiles in Asm
9 pages
Mongo DB
No ratings yet
Mongo DB
31 pages
Tablespace Management Oracle
100% (1)
Tablespace Management Oracle
5 pages
Query Optimization
No ratings yet
Query Optimization
9 pages
24 Hadoop Interview Questions & Answers For MapReduce Developers - FromDev
No ratings yet
24 Hadoop Interview Questions & Answers For MapReduce Developers - FromDev
7 pages
SQL Joins and Operations Guide
No ratings yet
SQL Joins and Operations Guide
8 pages
Lab 1 - Accessing and Preparing Data
No ratings yet
Lab 1 - Accessing and Preparing Data
36 pages
Oracle 12c - CDB - PDB - Performing Basic Tasks PDF
No ratings yet
Oracle 12c - CDB - PDB - Performing Basic Tasks PDF
18 pages
Spring Data Access: By, Srinivas Reddy.S
No ratings yet
Spring Data Access: By, Srinivas Reddy.S
21 pages
EXAMREVIEW AWSCertifiedDeveloperAssociate
No ratings yet
EXAMREVIEW AWSCertifiedDeveloperAssociate
354 pages
Using GIT With Talend Studio
No ratings yet
Using GIT With Talend Studio
7 pages
Teradata SQL Performance Tuning Case Study Part II
0% (1)
Teradata SQL Performance Tuning Case Study Part II
37 pages
Datastage Admin
No ratings yet
Datastage Admin
161 pages
Nagios Interview Guide
No ratings yet
Nagios Interview Guide
7 pages
CDC Installation
No ratings yet
CDC Installation
686 pages
Hadoop HBASE
No ratings yet
Hadoop HBASE
71 pages
Unit - IV - Notes
No ratings yet
Unit - IV - Notes
23 pages
Hbase - in Detail: Pushpinder Singh Paxcel Technologies
No ratings yet
Hbase - in Detail: Pushpinder Singh Paxcel Technologies
32 pages
Ba Iift 17-18
No ratings yet
Ba Iift 17-18
40 pages
PHP Security for Developers
100% (2)
PHP Security for Developers
89 pages
Multmedia Studies
No ratings yet
Multmedia Studies
15 pages
Database Transaction States
No ratings yet
Database Transaction States
4 pages
Oracle Datafile Resizing Guide
No ratings yet
Oracle Datafile Resizing Guide
13 pages
R-Trees - Presentation Slides
100% (1)
R-Trees - Presentation Slides
44 pages
ESG Dell Storage Portfolio Brochure
No ratings yet
ESG Dell Storage Portfolio Brochure
12 pages
ABAP Dictionary Guide: SE11 & DDIC
No ratings yet
ABAP Dictionary Guide: SE11 & DDIC
24 pages
Java Lab 25 Questions
100% (1)
Java Lab 25 Questions
4 pages
Database Management System: Dr. Neha Gulati University Business School Panjab University
100% (1)
Database Management System: Dr. Neha Gulati University Business School Panjab University
30 pages
SAP BODS Mock Test Answers
No ratings yet
SAP BODS Mock Test Answers
7 pages
Merged OSEI 041P
No ratings yet
Merged OSEI 041P
13 pages
Security Architecture and Design
No ratings yet
Security Architecture and Design
33 pages
Maintenance Order Program SAP
100% (1)
Maintenance Order Program SAP
2 pages
Saida
No ratings yet
Saida
41 pages
830-00742-47 ZMS Admin Guide
No ratings yet
830-00742-47 ZMS Admin Guide
158 pages
Secure PHP Login Form Script
No ratings yet
Secure PHP Login Form Script
4 pages
Spec Sheet Rubrik Appliance Specs r6000 (2020)
No ratings yet
Spec Sheet Rubrik Appliance Specs r6000 (2020)
1 page
SQL With TADOQuery
No ratings yet
SQL With TADOQuery
3 pages
MCC Workshop Booklet
100% (3)
MCC Workshop Booklet
158 pages
1.strating Classes (20 Files Merged)
No ratings yet
1.strating Classes (20 Files Merged)
743 pages
Chapter 1
No ratings yet
Chapter 1
21 pages
Sysmaster Database Insights
No ratings yet
Sysmaster Database Insights
23 pages
Dbms Unit V
No ratings yet
Dbms Unit V
27 pages
ECS 3.6.1 Monitoring Guide Rev1.1
No ratings yet
ECS 3.6.1 Monitoring Guide Rev1.1
80 pages
Veeam Definitive Guide 2023
No ratings yet
Veeam Definitive Guide 2023
36 pages
Tutorial On File Organization: Comparison of File Organizations
No ratings yet
Tutorial On File Organization: Comparison of File Organizations
15 pages
IManager U2000 Security and Data Management
No ratings yet
IManager U2000 Security and Data Management
45 pages
Py4inf Exercises
No ratings yet
Py4inf Exercises
14 pages
Orchadmin Guide for Data Engineers
No ratings yet
Orchadmin Guide for Data Engineers
2 pages
Chapter 04 - ABAP Data Declarations
No ratings yet
Chapter 04 - ABAP Data Declarations
33 pages

Hbase in Practice

Uploaded by

Hbase in Practice

Uploaded by

DataWorks Summit 2017 - Munich

Lars George – Partner and Co-Founder @ OpenCore

• Group multiple columns into

• Reads may have to span

• The best performance is gained from using row keys

Motto: “Design for Reads”

• Hotspotting on regions is bad!

• Based on access pattern, either use

 Stricter resource limits,

Random Access Short Scan Full Scan

HBase + MR/Spark Analytic archive

Random Access Short Scan Full Scan

You might also like