Serverless: The Missing Manual

#WISSENTEILEN
Serverless
Lars Röwekamp | open knowledge GmbH
@_openKnowledge | @mobileLarson
The Missing Manual

ÜBER OPEN KNOWLEDGE
Branchenneutrale Softwareentwicklung & IT-Beratung

ÜBER MICH
Wer bin ich - und wenn ja, wie viele?
• CIO New Technologies
• Enterprise & Mobile
• Autor, Speaker, Coach & Mentor
• Snowboard & MTB Enthusiast (a.k.a. “stets bemüht“)
Lars Röwekamp (a.k.a. @mobileLarson)

Was ist die Idee
von Serverless?

#WISSENTEILEN
Run code, not servers
Serverless Function: Entwickler schreibt eine Business-
Funktion, „bundled“ diese mit den entsprechenden
Abhängigkeiten (LIBs) und lädt sie in die Cloud.
Serverless Environment: Führt die Funktion bei „Aufruf“ in
der passenden Runtime effizient, flexibel und hoch skalierbar
aus.
“

#WISSENTEILEN
No machines, VMs or containers*
Entwickler: Fokussiert sich ausschließlich auf die
Umsetzung der Business-Logik und das Erstellen des
Function-Bundle.
Cloud Provider: liefert und maintained rundum-sorglos
Umgebung für die Serverless Functions, inklusive etwaiger
Cloud Services (z.B. Storage, DB, Streaming, AI).
“

AWS Cloud
1
trigger
request
Hands-on: Hello World

AWS Cloud
hello world serverless context
1
trigger
request

AWS Cloud
hello world serverless context
HelloWorld
Logs
1
trigger
request
2

#WISSENTEILEN
“Run your business code
highly-available
in the cloud in response
to events and scale
without any servers to
manage.“*
* AWS Lambda Advertising

#2:
No servers to
provision or manage

#3:
Build in high availability
and disaster recovery

#4:
Scale with usage
by design

Management:
“Hmm, ich bin noch
nicht überzeugt!.”

#5:
Never pay idle
(Management: „Ok, I bin definitv dabei!“)

AWS
Lambda Microsoft
Azure Functions
Oracle Functions
a.k.a. Project FN***
IBM Cloud Functions
a.k.a Apache OpenWhisk**
Google Cloud
Functions
Project Riff
sponsored by Pivotal

Szenario #1: Datei-/Datenbearbeitung
Datei- oder Datenbearbeitung nach Ablage im Storage System
• Bildbearbeitung
• Thumbnail-Erzeugung
• PDF-Generierung

AWS Cloud
1
upload
image

AWS Cloud
1 2
upload
image

AWS Cloud
Store raw Image
StoreImage
Logs
1 2
3
upload
image

AWS Cloud
Store raw Image
StoreImage
Logs
S3 Object
created1 2
3
4
upload
image

AWS Cloud
Create ThumbnailStore raw Image
StoreImage
Logs
CreateThumbnail
Logs
S3 Object
created1 2
3
4
5
upload
image

Szenario #3: Stream Processing
Regelmäßiges Abarbeiten von Streaming Data
• Social Media Trendanalysen
• Sensor Data Monitoring / Anomaly Detection

AWS Cloud
1
sensor data stream is
uploaded to Kinesis
in real-time
tons of
very important
sensor data

AWS Cloud
Data Stream Analysis
StreamAnalyzer
Logs
1
uploaded to Kinesis
in real-time
2
Lambda runs code to
detect anomalies
tons of
very important
sensor data

AWS Cloud
StreamAnalyzer
Logs
store anomalies
extracted by lambda
function
1
uploaded to Kinesis
in real-time
2
3
Lambda runs code to
detect anomalies
tons of
very important
sensor data

AWS Cloud
StreamAnalyzer
Logs
Real-Time Monitoring / Querying
store anomalies
extracted by lambda
function
1
uploaded to Kinesis
in real-time
2
3
Lambda runs code to
detect anomalies
4
data immediately
available for interested
parties to query
tons of
very important
sensor data

Szenario #4: Web Application
Serverless „all in“ einer Anwendung…
• Ausliefern von statischem Content via CDN
• Authentication / Autorization via BaaS
• Businesslogik via FaaS (unter Verwendung von PaaS)

AWS Cloud
Web Client
region aware
web app
delivery
1

AWS Cloud
Web Client
region aware
web app
delivery
1
login via id/pwd
returns JWT
2

AWS Cloud
Web Client
region aware
web app
delivery
1
login via id/pwd
returns JWT
2
3
REST
call

AWS Cloud
Web Client
region aware
web app
delivery
1
login via id/pwd
returns JWT
2
3
REST
call
4
translated
lambda
trigger

AWS Cloud
Web Client
storage related functions
region aware
web app
delivery
1
login via id/pwd
returns JWT
2
3
REST
call
4
translated
lambda
trigger
5
lambda
@work

AWS Cloud
Web Client
database related functions
region aware
web app
delivery
1
login via id/pwd
returns JWT
2
3
REST
call
4
translated
lambda
trigger
5
lambda
@work
5
lambda
@work

AWS Cloud
Web Client
database related functions
additional functions, e.g.
region aware
web app
delivery
1
login via id/pwd
returns JWT
2
6
3
REST
call
4
translated
lambda
trigger
5
lambda
@work
5
lambda
@work

The Road to the Cloud ...
Der Serverless Showcase

Web Image Gallery
(easy version)
GET ../images/{imageId}
PUT ../images/{imageId}
DELETE ../images/{imageId}
POST ../images/

Web Image Gallery
(not so easy version)
POST ../images/

Web Image Gallery
(real life version)
POST ../images/

AWS Cloud
Use-Case: Upload Image
upload image
with additional
information

AWS Cloud
Store raw Image
1
upload image
with additional
information

AWS Cloud
Store raw Image
Store Image Information
1
2
upload image
with additional
information

AWS Cloud
AWS Step Functions workflow: Store Image
Store raw Image
1
2
upload image
with additional
information

AWS Cloud
Create ThumbnailStore raw Image
1
2
upload image
with additional
information

AWS Cloud
Create Thumbnail
Inform Subscribers
Store raw Image
1
2
upload image
with additional
information

„Was kann
da schon
schiefgehen?“

AWS Cloud
Create Thumbnail
Inform Subscribers
Store raw Image
1
2

highly-available
in the cloud in response
to events and scale
without any servers to
manage.“*
*(AWS Lambda product description)

highly distributed
and event driven in a non
transparent environment
with no single
point of control.“*
*(my personal interpretation)

Wie teste ich?
meine Serverless Application

Was, wann, wie und wo sollte ich testen, um …
• Vertrauen in meinen Code zu gewinnen
• das Risiko von Fehlern zu minimieren*
* vor allem in Produktion

Testen in der traditionellen Welt

Testen in der Serverless Welt
„The biggest complexity is not within
the function itself, but in how it interacts
with other functions and services
(a.k.a. cloud components).“

Testen in der Serverless Welt
Ziele des Testens: „Risiko minimieren“
• Risiko Konfiguration
• Risiko technischer Workflow
• Risiko Businesslogik
• Risiko Integration

„Don‘t let your users
test your code!“

„Welche Art von ‚Benchmarks‘ wollen wir für unser Testing?“
• funktionale Änderungen schnell/kosteneffizient testen
• integrative Änderungen schnell/kosteneffizient testen
• integrative Änderungen so „real“ wie möglich testen
• Use-Cases und User-Stories so „real“ wie möglich testen
Testing Best Practices

#1 Trennen von Businesslogik und Infrastruktur
AWS CloudOn-Premise
handler
logic
Kandidat für Unit Tests
e
i
u
Kandidat für Integration Tests
Kandidat für End-to-Ende Tests
u

#2 Cloud-Infrastruktur Komponenten mocken
AWS CloudOn-Premise
handler
logic u
um
e
i
u

fake infrastructure component (Context)

mock infrastructure component (Context)

#3 Lokale Umgebung für funktionale Tests verwenden (z.B. SAM local)
AWS CloudOn-Premise
handler
logic uvia SAM local
via SAM local
SAM
yaml
TEST
u
u
e
i
u

$ sam local invoke "Greetings" -e event-greeting.json --env-vars env.json
function name payload for function

#4 Lokale Umgebung zum Triggern von Integration Tests verwenden
AWS CloudOn-Premise
handler
via SAM local
SAM
yaml
TEST
u
u
i
i
e
i
u

$ sam local start-api –p 8080

#5 Lokale Cloud-Komponenten für Integration Tests*
AWS CloudOn-Premise
handler
logic u
via DynamoDB local
via FakeS3 via SAM local
via SAM local
SAM
yaml
TEST
u
u
i
i
i
i
WARNUNG: lokale Cloud
Komponenten können
lediglich funktionale
Korrektheit sicherstellen,
nicht aber infrastrukturelle,
wie z.B. DLQs, Timeouts,
Throttling, SLAs, …
e
i
u

$ sam local generate-event [SERVICE] [OPTION]
Simulate Component Event to trigger Lambda

Simulate Component triggered by Lambda
$ aws –endpoint-url=http://localhost:8000 dynamodb list-tables
$ java –jar DynamoDBLocal.-jar

Simulate Component triggered by Lambda
$ aws –endpoint-url=http://localhost:8000 dynamodb list-tables

#6 temporäre Integration-Cloud für partielle Integration Tests
AWS CloudOn-Premise
handler
via SAM local
SAM
yaml
TEST
u
u
via DynamoDB local
via FakeS3
i
i
Temorary Intregration #Dev1
ii
INT
i
i
e
i
u

#7 permanente Integration-Cloud für End-to-End Tests
AWS CloudOn-Premise
handler
via SAM local
SAM
yaml
TEST
u
u
via DynamoDB local
via FakeS3
i
i
Permament IntregrationINT
e
e
e
e
i
i
e
i
u

„Sind wir endlich
fertig?“

Testing endet nicht
in Produktion!

Testing in Produktion
Ziele des Testens: „Vertrauen gewinnen“
• Outages von Cloud & Cloud-Komponenten
• Outages von 3rd Party Apps
• Bugs / Probleme durch Skalierung

Robustes Monitoring und Error Reporting
• Logging
• Tracing
• Metrics
• Alerting
Vorhersagen von Störungen
inklusive automatischer
Regenerierung!

Chaos Engineering
• bewusst kleine “Probleme“ und „Fehler“
in das System einstreuen!

Wie monitore ich?
meine Serverless Application

Mit einem gut geplantes Monitoring sollten wir in der Lage sein, …
• aufkommende Probleme vorherzusagen
• schnell die Ursache von Problemen zu identifizieren
• automatische Recovery-Prozesse anzustoßen
• notwendige Alarme zu triggern
Real-Life Monitoring

Business
KPI
UX
SLA
“Produkte
pro Bestellung”
“Durchschnittlicher
Bestellwert”
“Abbruchrate”
“Erste Darstellung
von Inhalten”
“Erste sinnvolle
Darstellung"
“Erste
Interaktion”
“Verfügbarkeit”
“Latenz”
“Beständigkeit”
“Konsistenz”

Gut geplantes Monitoring berücksichtigt verschiedene Aspekte
• reliability: Komponenten und Kommunikation
• usage: funktional und nicht-funktional
• performance: Dauer, Latenz und Timeouts
• security: Zugriffsrechte, Attacken
• costs: aktuelle Kosten, Kostenentwicklung

Die 4 Säulen des Monitorings
3
2 4
1
Tracing Metrics
Alerting
Logging

3
2
Tracing Metrics
4
4
Alerting
Repräsentiert den State
einer Anwendung.
Wenn etwas schiefläuft
benötigen wir LOGs, um
herauszufinden, welche
Änderungen am State den
Fehler verursacht haben.
1
Logging
Logging

3
Metrics
4
1
Alerting
Logging
Tracing
2
Repräsentiert eine
einzelne „User‘s Journey“
durch den gesamten
Stack der Anwendung.
Tracing wird oft zur
Optimierung des Systems
genutzt.
Tracing

2
Tracing
4
1
Alerting
Logging
3
Metrics
Repräsentiert einen über
einen Zeitraum
aggregierten Messpunkt.
Hilft dabei, den aktuellen
„Health-Status“ des
Systems sowie dessen
Entwicklung festzustellen.
Metrics

3
2
Tracing Metrics
1
Logging
4
Alerting
Die Komponente des
Monitorings, die
basierende auf Metriken,
Aktionen auslöst.
Meist zur automatischen
„Selbstheilung“ verwendet
oder im zuständige
Personen zu informieren.
Alerting

Für ein gut geplantes Monitoring, sollten man daher …
• Events loggen, die eine State Transformation anstoßen
• Standard-Metriken sammeln
• Custom-Metriken definieren und sammeln
• Distributed Tracing ermöglichen
• Alarme auf individuellem und aggregierten Level definieren
Serverless Application Monitoring

Monitoring Strategie
AWS Cloud
Logging
Tracing
Metrics
Alerting

Monitoring Strategie: Plattform Services
AWS Cloud
Logging
Logging
Tracing
Metrics
Alerting

AWS Cloud
Logging
Alerting
Metrics
“BASIC ALERTING FOR FREE”
“BASIC METRICS FOR FREE”
Logging
Tracing
Metrics
Alerting

AWS Cloud
Alarm
Logging
Alerting
Metrics
Logging
Tracing
Metrics
Alerting

AWS Cloud
Logging
Alerting
Metrics
Tracing (still DIY)
Alarm
Metrics
Logging
Tracing
Metrics
Alerting

AWS Cloud
Alarm
Logging
Tracing
Alerting
Tracing (DIY)
Metrics
Logging
Tracing
Metrics
Alerting

Monitoring Strategie #2: Plattform Services
Logging
Tracing
Metrics
Alerting

„Welche Art von ‚Benchmarks‘ wollen wir für unser Monitoring?“
• Sammeln von umfangreichen System- und Anwendungsmetriken
• Metriken und Logs sollten keine User-facing Latency verursachen
• Metriken und Logs sollten in Real-Time verfügbar sein
• Metriken und Logs sollten granular und korreliert vorliegen
Monitoring Best Practices

#1 User-facing Latency vermeiden
AWS Cloud
My Lambda logs
log
stream
log
data
async
sync
Log Aggregator
log
data
1
very fast and cheap
2
3
time consuming and “expensive”
parse
log stream

#2 umfangreiche System-/Anwendungsmetriken sammeln
AWS Cloud
My Lambda logs
log
stream
log
data
async
sync
Log Aggregator
metrics
custom
metrics
custom
metrics
log
data
2
3
1
very fast and cheap
parse
log stream
custom
metrics

#3 unnötige Kosten vermeiden
AWS Cloud
My Lambda logs
log
stream
log
data
async
sync
Log Aggregator
metrics
custom
metrics
custom
metrics
log
data
archive
logs
1
2
custom
metrics

#4 Logs und Metriken korrelieren / aggregieren
AWS Cloud
My Lambda logs
log
stream
log
data
async
sync
Log Aggregator
metrics
custom
metrics
custom
metrics
log
data
archive
logs
1
correlation
ID
custom
metrics

#5 Logging via ENV Vars an Edge Server enablen/disablen
AWS Cloud
My Lambda logs
log
stream
log
data
async
sync
Log Aggregator
metrics
custom
metrics
custom
metrics
log
data
archive
logs
DEBUG
on/off
ENV var
1
2
custom
metrics

Schlussfolgerung: Spaß haben mit
Serverless?

“Find suitable
serverless workload
and apply the correct
integration patterns.”

Lars Röwekamp, @mobileLarson
Kontakt:
lars.roewekamp@openknowledge.de
kontakt@openknowledge.de
Besten Dank! #WISSENTEILEN

Serverless: The Missing Manual

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Serverless: The Missing Manual

Similar a Serverless: The Missing Manual (20)

Más de OPEN KNOWLEDGE GmbH

Más de OPEN KNOWLEDGE GmbH (17)

Serverless: The Missing Manual