Monitoring systems comprised of multiple services is typically done by monitoring each service individually using it's logs, or on an end to end basis that lacks visibility into the individual performance characteristics of each service. Root cause analysis is usually based on operations personnel instinct and past experience, making automated remediation next to impossible for many use cases.
With thatDot's streaming graph logs and events from servers, operating systems, databases, applications, and clients are ingested in real-time and assembled into a graph data model. The graph data model natively connects events with unlimited categorical classifications and calculated metrics to identify "alerts that matter" and instantly associate them to servers, VMs, containers, code versions, subnets, etc. This real-time comprehensive view of the inter-relationships between services allows rapid assessment of root causes for operations investigations or automated remediation workflows.
This Quine log analysis recipe can help you get started, showing how to ingest and model your own application logs into a streaming graph for real-time alerts and root cause diagnosis.