Operational status Virtual Tenants
What is the Ops status for Virtual Tenants
Hint
TrackMe Virtual Tenants are themselves continuously monitored to report any kind of failures that would be encountered by any of the application trackers.
TrackMe trackers can fail when technical issues are encountered, this can happen for various reasons, such as:
Overloaded environments, where there are not search slots available (although TrackMe has a concept of automated re-attempts)
Search Head is in detention (overused file-systems, etc)
Network connectivity issues, credentials issues
Corrupted configuration (ex: missing Knowledge Objects or incorrect permissions)
And many more
With the goal of providing the highest visibility and quality, every Tracker and related custom commands carefully report failures to a central component called the components register.
In addition, when a Virtual Tenant is created, a Health Tracker is created which investigates the component register results, reports and generates Health events continuously.
You can easily observe the operation status of TrackMe Virtual Tenants in the Virtual Tenants user interface:
Click on the chart to open the detailed view:
Click on “Open in search” to access detailed per tracker results in the Splunk Search user interface:
Health events are stored in the audit indexe(s) of the Virtual Tenant(s), you can search for Health failures events easily, such as:
index=trackme_audit sourcetype="trackme:health" last_status="failure"
| timechart count by tenant_id
Example of failure detection
In this example, a remote Splunk Deployment is encountering issues due to a file-system being filled over the accepted limit by Splunk, therefore this instance does not accept searches any longer.
We can observe the Vetants Ops status reporting issues currently, as well as which Virtual Tenants are affected currently:
Accessing the detailed Ops status view quickly shows the affected tenants, related accounts and technical root cause:
We can access the live status from REST:
Clearing the Virtual Tenants Operational Status
From TrackMe 2.0.97, you can clear the Virtual Tenants Operational Status via the Virtual Tenants UI and from the REST API:
Access to the screen TrackMe Tenants Operational health statuses:
You can then request to clear for all tenants, or a selection of tenants and you can also choose to execute or not the health tracker during the clear process:
Underneath, the UI calls the REST endpoint to clear the Virtual Tenants Operational Status:
Example:
| trackme url="/services/trackme/v2/vtenants/admin/clear_exec_summary" mode="post" body="{'tenants': 'all'}"
Additional resources
An additional (UFS) dashboard is available at:
Navigation Bar / Audit & Troubleshoot / Audit - Operational Statuses
Virtual Tenants Ops status technical overview
When a Virtual Tenant is created, a Health tracker is created and executed every 5 minutes:
trackme_health_tracker_tenant_<tenant_id>
This tracker executes the following custom command:
| trackmetrackerhealth tenant_id="<tenant_id>"
This custom command performs various operations and relies on the REST endpoint:
| trackme mode=post url="/services/trackme/v2/configuration/get_tenant_ops_status"
TrackMe components, such as Hybrid Trackers and their related custom commands call the components register at the backend level, and these states are stored in a Python dictionanry in the central store KVstore collection:
| inputlookup trackme_virtual_tenants | eval keyid=_key
| fields tenant_id, tenant_objects_exec_summary