GTP IT Guardian
Production-grade, self-hosted, multi-tenant IT monitoring SaaS platform. Full-stack observability for bare-metal servers, virtual machines, and containerised workloads — all under a single pane of glass with complete tenant isolation.
Five Best-of-Breed Components, One Cohesive Platform
Rather than relying on a single tool, GTP IT Guardian assembles the best open-source components into an integrated, enterprise-grade monitoring stack — each serving a distinct operational role.
Battle-tested alert engine
Active host and service checks (ping, SSH, HTTP, disk, CPU) on a configurable schedule — fires alerts the moment something goes wrong.
Modern time-series metrics backbone
Scrapes hundreds of metrics per second via node_exporter agents, stores them in a high-performance TSDB, and feeds Grafana and the AI engine.
Per-tenant dashboards & visualisation
Each tenant gets their own isolated Grafana organisation with pre-provisioned dashboards scoped to their hosts — they only ever see their own data.
JWT-authenticated orchestration brain
Handles tenant and host lifecycle operations, auto-provisioning Nagios configs, Prometheus file-SD entries, and Grafana organisations on every registration.
Isolation Forest & capacity forecasting
Builds per-host behavioural models, runs anomaly scoring every 5 minutes, and forecasts disk exhaustion via polynomial regression — catching problems before outages.
Persistent state & rate limiting
PostgreSQL stores tenants, users, hosts, billing records and refresh tokens. Redis provides sliding-window rate limiting and high-speed config caching.
S3-compatible ML model persistence
Stores serialised Isolation Forest models and StandardScalers so the AI engine retains learned baselines across container restarts and redeployments.
Full Logical Isolation Across Every Layer
Each customer organisation gets their own isolated dashboards, alerts, and data on a single shared platform — operations teams manage centrally while tenants experience complete isolation.
tenant_id UUID foreign key — TENANT_ADMIN users only query rows scoped to their own tenanttenant="slug" label — Grafana datasources auto-filter to that label by defaultAI-Powered Anomaly Detection & Capacity Forecasting
The Guardian AI Engine runs three continuous ML workflows — detecting behavioural anomalies, forecasting resource exhaustion, and clustering correlated alerts into root-cause events.
Avg Anomaly Score
Days to Disk Full
Detection Cycle
Forecast Confidence
Comprehensive Host & Infrastructure Alert Rules
Guardian ships with pre-configured Prometheus alert rules and Alertmanager routing for critical deduplication and suppression — preventing alert storms while ensuring no incident goes unnoticed.
alertname + tenant + instance with configurable group wait and repeat intervalsAlert Rules Reference
From Demo to Production in Minutes
Guardian ships as a fully containerised stack supporting both Docker Compose and Kubernetes. A zero-config demo stack with six pre-seeded target systems lets you evaluate the full platform without any prerequisites.
make demo, no credentials requiredmake ssl-letsencryptmake onboard-tenant provisions a full tenant in one commandQuick Start — Demo Stack
# Clone and launch the full demo
git clone https://github.com/your-org/gtp-saas-monitoring
cd gtp-saas-monitoring
make demo
Services Available
Transparent Per-Host Pricing with Stripe Integration
Guardian includes built-in multi-tenant billing with Stripe — usage snapshots, invoice generation, and overage tracking all managed through the control plane API.
hosts included
hosts included
hosts included
hosts (configurable)
What Makes GTP IT Guardian Unique
Complete data isolation at every layer — PostgreSQL, Prometheus, Grafana, and Nagios — with automatic provisioning on tenant creation.
Isolation Forest models built per host from 24 hours of Prometheus data — scores every 5 minutes and fires webhooks on threshold breach.
JWT short-lived tokens, rotating refresh tokens, RBAC with four roles, Redis sliding-window rate limiting, and full HTTPS with HSTS.
Stripe integration with per-host pricing, overage tracking, usage snapshots, and invoice generation — all managed through the REST API.
Full Kubernetes manifests with HPA auto-scaling — deploy to EKS, GKE, AKS, or self-managed clusters with cert-manager TLS.
Host registration triggers Nagios config, Prometheus file-SD, and Grafana provisioning simultaneously — monitoring begins within 60 seconds, no restarts required.
Slack, Microsoft Teams, and SMTP email — configured via environment variables with Alertmanager routing, deduplication, and silence management.
Nightly cron-scheduled backups of PostgreSQL, Grafana, Prometheus rules, and Nagios config — with documented Docker volume recovery for full disaster recovery.
Platform Architecture
All external traffic enters through Nginx (TLS 1.2/1.3, HSTS, rate limiting) and routes to the FastAPI control plane, Grafana, and Nagios. Prometheus scrapes every registered host via file service-discovery.
Tenant & Host Orchestration
Pre-provisioned Dashboards
Alert State Engine
Deploy Enterprise IT Monitoring for Your Clients Today
Schedule a demo to see GTP IT Guardian in action — from the zero-config demo stack through to full Kubernetes production deployment with multi-tenant billing and AI anomaly detection.
Schedule a Demo