Monitoring & alerts

Network Monitoring & Alerts

Real-time monitoring and alerting for the routers, APs, and CPE an ISP actually runs — device up/down, CPU, memory, temperature, interface health, and metric thresholds — pulled across TR-069, SNMP, RouterOS, and controllers and converged into one device timeline.

See SNMP integration
Operator console

Subscribers, devices, and access in one view

subscribers
4.2k
sites live
3
critical fails
0
LiveSubscriber + device identityMapped
ActionGuarded command previewApproved
AuditEvery change loggedTracked
One operator surface ties each subscriber to their devices, access, and history.
Page type
Solutions
Primary search
ISP network monitoring software
Updated
2026-06-09

Monitor the network you actually run

A small or mid-sized ISP runs a mix of subscriber CPE, managed edge routers, APs, switches, and backhaul — usually across two or three protocols at once. The problem is rarely a lack of graphs. It is that device status lives in one tool, SNMP counters in another, RouterOS reads in a third, and the controller in a fourth, so nobody can answer the only question that matters when the phone rings: who is affected, and what changed.

ISPAgents monitors all of it in one place. Metrics from TR-069 / CWMP diagnostics, SNMP polling, RouterOS REST probes, and controller APIs are normalized into the same model and converged into a single tenant and device timeline — so monitoring, alerts, syslog, and audit sit on one surface rather than four disconnected dashboards.

One device timeline, many sources

ISPAgents is ISP-aware infrastructure observation, not an attempt to beat a generic NMS at its own game. Multi-source metrics are normalized with source overlays, so a chart can show the same metric from more than one path and label exactly where each reading came from — and how fresh it is.

SourceWhat it contributes
TR-069 / CWMP diagnosticsCPE-side reads, host visibility, and diagnostics returned by the device's own data model.
SNMP pollingInterface counters, system resources, and infrastructure/managed-edge observation.
RouterOS REST probesNative MikroTik reads for queues, interfaces, and system resources.
Controller APIsDevice and telemetry rollups pulled from external controllers into the same model.

Source freshness is shown honestly: a stale reading is labeled stale rather than presented as live, so an operator never mistakes an old value for current state.

The event engine

A real-time event engine watches device and metric state and raises events when something crosses a line. It ships ten device event types today, with deduplication and throttling so a flapping router does not bury a real outage, and multi-channel delivery so the right people see it.

Event typeFires when
OnlineA device comes back and is confirmed reachable.
OfflineA device stops reporting and is confirmed down.
CPU spikeCPU crosses the threshold momentarily.
CPU sustainedCPU stays above the threshold over time.
Memory spikeMemory crosses the threshold momentarily.
Memory sustainedMemory stays above the threshold over time.
Temperature spikeTemperature crosses the threshold momentarily.
Temperature sustainedTemperature stays above the threshold over time.
Daily digestA scheduled summary of the day's device events.
License warningA device or platform license needs attention.

Delivery is multi-channel — email and webhooks — so events can reach an inbox, a chat channel, or an existing ticketing or on-call system without bolting on a separate alerting product.

Threshold rules that inherit

Thresholds are not set device by device. Metric threshold rules inherit down a hierarchy — catalog to group to device — so a policy defined once applies across a fleet, and an override is only needed where a specific device genuinely differs. Breach state persists, so a rule that is currently violated stays visible as an active condition rather than flickering on and off with each poll.

  • Define a rule once at the catalog level and let it inherit to every group and device beneath it.
  • Override at the group or device level only where the equipment warrants it.
  • Persisted breach state means an ongoing problem reads as ongoing, not as a stream of disconnected blips.

Charts built for operators

Metrics land in chart workspaces an operator can arrange and keep — private to one user or shared across a team — so the views a NOC relies on during an incident are saved, not rebuilt each time. Because every source feeds the same device timeline, a chart sits next to the syslog and audit context for the same device, and an operator can move from "this metric looks wrong" to "here is what changed" without leaving the page.

Coexist, then converge

Start by connecting the sources you already have in read-only mode — SNMP, RouterOS, TR-069, or a controller — and watch the same devices appear in one timeline with nothing switched off. Turn on the event types your team cares about, set threshold rules at the catalog level, and route alerts to email and webhooks. Keep your existing tooling in place and let ISPAgents become the single pane as the evidence proves clean.

Pair monitoring with the SNMP integration for infrastructure and managed-edge polling, add syslog monitoring for event context on the same timeline, and connect MikroTik management so the routers you watch are also the routers you can back up and act on — all mapped to one canonical subscriber and device identity.

Next step

See how this works in your network.