KEMBAR78
KubeMonkey를 통한 Chaos Engineering 실전 운영하기 - 윤석찬 (AWS 테크에반젤리스트) | PDF
0 / . 21 0 26
08 229 C A
F ) ( , ( ) ,
I
) # 2 (2
1 . 2 !
/- . / -/ -
. .
.
.
Werner Vogels
CTO – Amazon.com
“ “
Netflix 2013
Active-Active for Multi-
Regional Resiliency
https://medium.com/netflix-techblog/active-active-
for-multi-regional-resiliency-c47719f6685b
Amazon 2011
GameDay: Creating
Resiliency Through
Destruction
https://www.youtube.com/watch?v=zoz0ZjfrQ9s
Chaos Monkey
https://github.com/ne
tflix/chaosmonkey
Instance Fail?
Chaos Gorilla
Zone Fail?
Chaos Kong
Region Fail?
.
100s of microservices
1000s of daily production changes
10,000s of instances
100,000s of customer interactions per minute
1,000,000s of customers
1,000,000,000s of metrics
10,000,000,000 hours of streamed
(c) Dave Hahn, A Day in the Life of a Netflix Engineer Using 37% of the Internet, re:Invent 2015
10s of operations engineers
No Datacenter
• 2008 – Datacenter failure
(must be a better way)
• 2009 – Cloud effort starts
• 2010 – The first devices are
talking to Netflix in the cloud
(US-EAST-1)
• 2011 – EU-WEST-1
• 2013 – US-WEST-2
(active/active)
• 2015 – Migration Complete
Edge
ELB
Zuul
NCCP
API
Middle Tier & Platform
Product
• Bucket testing
• Subscriber
• Recommendations
Platform
• Routing
• Configuration
• Crypto
Persistence
• Cache
• Database
(c) Josh Evans, Mastering Chaos A Netflix Guide to Microservices, QCon SF 2016
Microservices (applications)
DevOps(Culture)
C
haos
Engineering
Cloud (Scale)
http://principlesofchaos.org/
http://channy.creation.net/blog/netflix-
principles-of-chaos-engineering
•
•
•
•
•
C
E C C
!
https://www.oreilly.com/webops-
perf/free/chaos-engineering.csp
!
,
https://github.com/dastergon/awesome-
chaos-engineering
https://bit.ly/2uKOJMQ
4B
Chaos
Engineering
Team
Security
Red
Team
2
B
Chaos
Engineering
Team
Simian Army
Failure Injection
ChAP
Gremlin
( ,) : , . : ) .
S R C E
O ?
ER Q Q D
()
- - -
Spinnaker is an open source, multi-cloud continuous delivery platform for releasing software
changes with high velocity and confidence. http://www.spinnaker.io/
Chaos Testing in Kubernetes
https://github.com/arun-gupta/kubernetes-chaos
Chaos Toolkit, Istio, Gremlin
KubeMonkey Chaos
https://github.com/asobti/kube-monkey
kube-monkey is an implementation of Netflix's
Chaos Monkey for Kubernetes clusters
- - -
) ( )
App App App
X
• @LLH N AJ BJ@ L / 5 H=J 2 = 183A H
• @LLH ?J= A
• @LLH = = J? =L A -A
• @LLH L J== ?A ==JA ? A
• @LLH ?AL@ : L=J? = = J=
• @LLH = AP J? L= A = =J= = A A H H=J H
• @LLH = A .4=L AP7= @0 ?
• @LLH H= =J = L : L @ = ?A ==JA ? : L H
• @LLH ?AL@ : @ J = = @ = ?A ==JA ?
• @LLH A HJ= = L LA =L AP @ A J =JNA =
• @LLH J HA ? H L= L H HA ? 8 HLA =8 @= L8 @==L H
• @LLH A ? =? =JA J L = A
References
https://www.facebook.com/groups/chaosengkorea/
https://www.meetup.com/Korea-Chaos-Engineering-Community/

KubeMonkey를 통한 Chaos Engineering 실전 운영하기 - 윤석찬 (AWS 테크에반젤리스트)