Modern Monitoring - devopsdays Cuba

34
@bridgetkromhout Monitoring

Transcript of Modern Monitoring - devopsdays Cuba

Page 1: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Monitorin

g

Page 2: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

lives: Minneapolis,

Minnesota

works: Pivotal

podcasts: Arrested DevOps

organizes: devopsdays

Bridget Kromhout

Page 3: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

“…measuring value, throughput, and performance…

revenue rather than cost”

The Art of Monitoring (2016) James Turnbull

artofmonitoring.com

Page 4: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Why monitor?

The Art of Monitoring (2016) James Turnbull

artofmonitoring.com

Page 5: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Why monitor?Two customers of monitoring with different needs.

The Art of Monitoring (2016) James Turnbull

artofmonitoring.com

Page 6: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Why monitor?

The business:

UX data for product & engineering Measure value delivered

Two customers of monitoring with different needs.

The Art of Monitoring (2016) James Turnbull

artofmonitoring.com

Page 7: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Why monitor?

The business:

UX data for product & engineering Measure value delivered

Information Technology:

Visibility into state and failures Product & engineering decisions

Measure success of projects

Two customers of monitoring with different needs.

The Art of Monitoring (2016) James Turnbull

artofmonitoring.com

Page 8: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Method Type Focus

Manual Checklists, simple scripts

“Tribal knowledge” of things broken in the past

Minimizing downtime, managing assets

Reactive disk, CPU, memory checks

Thresholds, alerting; updated after incidents

Availability, assets, some customer

experience

ProactiveAutomatic; required for deployment

Alerting includes context, automated

remediation

Application performance,

business outcomes

Monitoring Maturity Model

The Art of Monitoring (2016) - James Turnbull - artofmonitoring.com

Page 9: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Monitoring Maturity

Model

The Art of Monitoring (2016) James Turnbull

artofmonitoring.com

Page 10: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Typical reactive notification

The Art of Monitoring (2016) - James Turnbull - artofmonitoring.com

Page 11: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Better notifications

Actionable Provide necessary context

Prevent alert fatigue

The Art of Monitoring (2016) - James Turnbull - artofmonitoring.com

(in the brave new cloudy-with-a-chance-of-containers world)

Page 12: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

“Nothing’s the same anymore.” Jeffrey Sinclair

Babylon 5

Page 13: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

containers: dawn of the third age?

Page 14: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

1979 (Version 7)1982 (BSD)

20042000

chroot FreeBSD jails Solaris Zones LXC

2008

A Brief History of Containers, Part 1 (before docker era)

Page 15: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

A Brief History of Containers, Part 2 (docker common era)

2011

Cloud Foundry

2013

Docker

2014

Rocket (later rkt)

2015

Open Container Initiative

Page 16: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

containing dramas

Page 17: Modern Monitoring - devopsdays Cuba

@bridgetkromhout (autoscaling EC2 instances up & down over time)

ephemeral infrastructure

Page 18: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

“cattle, not pets”

(even adorable

Attack Kittens)

Page 19: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

• cloud-based infrastructure

• static checks or thresholds no longer scale

• manual configuration no longer scales

“cattle, not pets”

Page 20: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

automation: solution? or problem?(it depends)

Page 21: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

architectural considerations

Page 22: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Page 23: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

consistent development repeatable deployment

Why containers?

Page 24: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Monitoring containers

The Art of Monitoring (2016) James Turnbull

artofmonitoring.com

Page 25: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Events - changes in your environment Logs - diagnosis & troubleshooting

Metrics - seeing state in real time for anomaly detection & pattern analysis

Page 26: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

The Art of Monitoring (2016) James Turnbull

artofmonitoring.com

Open source & SaaS

• choose TCP over UDP • configurable granularity • “push” vs “pull”

Page 27: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

“Almost every task run under Borg contains a

built-in HTTP server that publishes information

about the health of the task and thousands of performance metrics”

Large-scale cluster management at Google with Borg - Verma et al. 2015

“Almost every task run under Borg contains a

built-in HTTP server that publishes information

about the health of the task and thousands of performance metrics”

Page 28: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Image credit: James Ernest

Page 29: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Security Pros & ConsContainers limit attack surface & emitters don’t need ports open

but…

Microservices move IPC to network transactions & complexity is distributed

Page 30: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

removed restored

Information radiators

Page 31: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

monitoring: the old way

Page 32: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

monitoring: the new way

Page 33: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Page 34: Modern Monitoring - devopsdays Cuba

@bridgetkromhout

Thanks!