Informed and Assessable Observability Design Decisions in Cloud-native Microservice Applications

Borges, Maria C.; Bauer, Joshua; Werner, Sebastian; Gebauer, Michael; Tai, Stefan

doi:10.1109/ICSA59870.2024.00015

Computer Science > Software Engineering

arXiv:2403.00633 (cs)

[Submitted on 1 Mar 2024 (v1), last revised 12 Jul 2024 (this version, v2)]

Title:Informed and Assessable Observability Design Decisions in Cloud-native Microservice Applications

Authors:Maria C. Borges, Joshua Bauer, Sebastian Werner, Michael Gebauer, Stefan Tai

View PDF HTML (experimental)

Abstract:Observability is important to ensure the reliability of microservice applications. These applications are often prone to failures, since they have many independent services deployed on heterogeneous environments. When employed "correctly", observability can help developers identify and troubleshoot faults quickly. However, instrumenting and configuring the observability of a microservice application is not trivial but tool-dependent and tied to costs. Architects need to understand observability-related trade-offs in order to weigh between different observability design alternatives. Still, these architectural design decisions are not supported by systematic methods and typically just rely on "professional intuition". In this paper, we argue for a systematic method to arrive at informed and continuously assessable observability design decisions. Specifically, we focus on fault observability of cloud-native microservice applications, and turn this into a testable and quantifiable property. Towards our goal, we first model the scale and scope of observability design decisions across the cloud-native stack. Then, we propose observability metrics which can be determined for any microservice application through so-called observability experiments. We present a proof-of-concept implementation of our experiment tool OXN. OXN is able to inject arbitrary faults into an application, similar to Chaos Engineering, but also possesses the unique capability to modify the observability configuration, allowing for the assessment of design decisions that were previously left unexplored. We demonstrate our approach using a popular open source microservice application and show the trade-offs involved in different observability design decisions.

Comments:	21st IEEE International Conference on Software Architecture (ICSA'24)
Subjects:	Software Engineering (cs.SE)
Cite as:	arXiv:2403.00633 [cs.SE]
	(or arXiv:2403.00633v2 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2403.00633
Related DOI:	https://doi.org/10.1109/ICSA59870.2024.00015

Submission history

From: Maria C. Borges [view email]
[v1] Fri, 1 Mar 2024 16:12:20 UTC (1,193 KB)
[v2] Fri, 12 Jul 2024 18:50:12 UTC (1,193 KB)

Computer Science > Software Engineering

Title:Informed and Assessable Observability Design Decisions in Cloud-native Microservice Applications

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Informed and Assessable Observability Design Decisions in Cloud-native Microservice Applications

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators