ServiceWatch Metrics
Event Streams sends metrics to ServiceWatch. The metrics provided by default monitoring are data collected at 1‑minute intervals.
Basic Metrics
The following are the basic metrics for the Event Streams namespace.
Indicators whose names are shown in bold below are the indicators selected as key metrics among the default metrics provided by Event Streams. Key metrics are used to build service dashboards that are automatically created for each service in ServiceWatch.
Each metric provides guidance in the user guide about which statistical value is meaningful when querying that metric, and among the meaningful statistics, the statistical values displayed in bold text are the primary statistics. In the service dashboard, primary metrics can be viewed using the primary statistical values.
OS Basic Metrics
Category | Performance items | unit | Detailed description | meaningful statistics |
|---|---|---|---|---|
| CPU | CPU Usage | Percent | CPU usage |
|
| Disk | Disk Usage | Percent | Disk usage |
|
| Disk | Disk Write Bytes | Bytes/Second | Write throughput (bytes/second) on block device |
|
| Disk | Disk Read Bytes | Bytes/Second | Read capacity (bytes/second) from block device |
|
| Disk | Disk Write Requests | Count/Second | Number of write requests on block device (requests per second) |
|
| Disk | Disk Read Requests | Count/Second | Number of read requests on block device (requests per second) |
|
| Disk | Average Disk I/O Queue Size | None | Average queue length of requests issued to the block device |
|
| Disk | Disk I/O Utilization | Percent | The proportion of time the block device actually spent handling I/O operations |
|
| Memory | Memory Usage | Percent | Memory usage |
|
| Network | Network In Bytes | Bytes/Second | Received data rate (bytes/second) on the network interface |
|
| Network | Network Out Bytes | Bytes/Second | Transmitted capacity (bytes/second) on the network interface |
|
| Network | TCP Connections | Count/Second | Current total number of successfully established TCP connections |
|
| Network | Network In Packets | Count | Number of packets received on the network interface |
|
| Network | Network Out Packets | Count | Number of packets transmitted on the network interface |
|
| Network | Network In Dropped | Count | Number of packet drops received on the network interface |
|
| Network | Network Out Dropped | Count | Number of packet drops transmitted from the network interface |
|
| Network | Network In Errors | Count | Number of packet errors received on the network interface |
|
| Network | Network Out Errors | Count | Number of packet errors transmitted on the network interface |
|
Event Streams basic metrics
Category | Performance items | unit | Detailed description | meaningful statistics |
|---|---|---|---|---|
| JVM | JVM Threads Current | Count | Number of currently active threads |
|
| JVM | JVM Memory Usage | Bytes | Current amount of memory used by the JVM |
|
| JVM | Time spent in GC | None | JVM time spent on garbage collection (GC seconds per minute) |
|
| JVM | Count in GC | Count/Minute | Number of times the JVM executed the garbage collection method |
|
| kafka_controller | Offline Partitions | Count | Number of partitions that cannot be written to or read from because there is no active leader |
|
| kafka_controller | Active Controller | Count | Number of active controllers in the cluster |
|
| kafka_controller | Total of Topics | Count | Number of global topics observed in the active controller |
|
| kafka_controller | Leader Elections | Count/Minute | Leader election when a broker fails |
|
| kafka_controller | Unclean Leader Elections | Count/Minute | Elect a broker with OSR (out-of sync replica) as the leader |
|
| kafka_network | Request Queue | Count | Request queue size |
|
| kafka_server | Zookeeper Expired Sessions | Count/Minute | Zookeeper expired session |
|
| kafka_server | Incoming Messages | Count/Second | Message received by the Broker |
|
| kafka_server | Producer requests | Count/Second | Request (Produce) received by the Broker |
|
| kafka_server | Consumer requests | Count/Second | Request (Fetch) received by the Broker |
|
| kafka_server | Failed Fetch Requests | Count/Minute | Number of failed fetches by the Consumer |
|
| kafka_server | Failed Produce Requests | Count/Minute | Number of failed requests by the Producer |
|
| kafka_server | Incoming Byte Rate | Bytes/Seconds | Byte transmission rate received from client (per broker) |
|
| kafka_server | Outgoing Byte Rate | Bytes/Seconds | Byte transmission rate to the client (by broker) |
|
| kafka_server | Rejected Byte Rate | Bytes/Seconds | Bytes rejected per broker because the record batch size exceeds the max.message.bytes setting |
|
| kafka_server | Online Partitions | Count | Total number of partitions |
|
| kafka_server | Leader Partitions | Count | leader partition count |
|
| kafka_server | Under Replicated Partitions | Count | Number of partitions with insufficient replica count |
|
| kafka_server | Under Minisr Partitions | Count | Number of partitions whose in-sync replica count is less than minIsr |
|
| Systemd | Active Systemd Unit | None | Service Status
| - |