1 - Overview

Service Overview

Cloud Monitoring service collects usage and change information, and logs of operating infrastructure resources, and generates events when the set threshold is exceeded, notifying users. Through this, users can quickly respond to performance degradation and failures, and can easily establish resource capacity expansion plans for a stable computing environment.

Provided Functions

Cloud Monitoring provides the following functions.

  • Stable Computing Resource Management: You can easily check indicators such as CPU usage, disk usage, and memory usage. Since notifications are automatically sent to designated personnel when events occur in resources being used, you can operate computing resources stably and quickly analyze and respond to failures.
  • Convenient Monitoring: Resource status information can be easily monitored by creating a dashboard. Basic dashboards and user-defined dashboards are provided, and various types of widgets can be set up to easily and quickly create dashboards.
  • Event Metric Management: Event metrics can be easily set up with just a few clicks through the web-based console. Event metric settings for monitoring targets (event patterns, occurrence conditions, occurrence cycles, performance metrics, operation status, etc.) can be changed in various ways to suit the usage environment, and threshold settings and alarm settings can be easily managed.
  • Resource Log Management: Log data of resources can be collected and stored, and searches can be performed on target logs as needed. Additionally, events are quantified for major keywords, and when predefined conditions are met, notifications are automatically sent to designated personnel, providing a more stable usage environment.

Components

Dashboard

The monitoring dashboard allows you to check the operation status, event status, and usage rates of monitoring targets and services.

ItemDescription
RegionLocation of resources
Data Reference TimeReference time of data displayed on the dashboard
RefreshRefresh the dashboard based on the current time
Period SettingSet the data query period and refresh cycle
Monitoring StatusNumber and status of monitoring targets for each service in the account
Event HistoryDisplay recent 7-day events by risk level as a graph
Top 5 Performance UsageDisplay the top 5 monitoring targets with the highest performance usage
Event MapDisplay the number of events for each service by risk level
Event StatusDisplay a list of unprocessed events that have occurred
Table. Cloud Monitoring Dashboard Components

Performance Analysis

Performance analysis allows you to check the main performance items of monitoring targets and view current data and historical data for each performance item. Users can check the performance status of monitoring targets by service or period and analyze the results by comparing specific performance.

Log Analysis

Log analysis collects and checks the logs of monitoring targets and converts them into quantifiable data for monitoring. Basic logs are provided for each monitoring target, and users can create custom logs to collect and check additional logs.

Event Management

An event is a setting that notifies users when the performance value of a monitoring target meets certain conditions. By setting events, users can grasp monitoring information that they must know without missing it. For example, if an event is set to occur when a performance value related to overload exceeds a certain value, users will be notified whenever there is a risk of overload during resource operation, allowing them to respond before problems occur. Event management allows users to create events and set them to notify designated users when specific values occur during monitoring.

Preceding Services

Cloud Monitoring has no preceding services.

2 - How-to guides

Samsung Cloud Platform Monitoring is a resource management system that allows users to monitor and analyze the operation status of resources within an account on the Samsung Cloud Platform Console. Users can efficiently manage resources using the dashboard page, widgets, and chart features.

Note
  • Users can monitor resources created in the Samsung Cloud Platform Console with authorized accounts.
  • Users can log in to the Samsung Cloud Platform Console and navigate to Samsung Cloud Platform Monitoring to monitor resources.

Getting Started with Cloud Monitoring

To start using Samsung Cloud Platform Monitoring, follow these steps:

  1. Click on All Services > Management > Cloud Monitoring menu. This will take you to the Service Home page of Cloud Monitoring.
  2. Click on the Open Cloud Monitoring button on the Service Home page. This will take you to the Cloud Monitoring Console page.

Exploring the Cloud Monitoring Console

The top and left menus of the Cloud Monitoring Console are composed as follows:

CategoryDescription
Custom Dashboard ManagementCustom Dashboard
  • View and manage custom dashboards.
SupportSupport
  • Provides links to user guides and OpenAPI guides.
Region ListRegion List
  • Displays the regions being monitored for the current account.
  • Allows selecting regions provided by the account.
User InformationView user information and log out of Samsung Cloud Platform Monitoring.
Side MenuDisplays the main features of Samsung Cloud Platform Monitoring. Each menu can be clicked to navigate to the corresponding page.
  • Monitoring Dashboard: View the operation status, event status, and usage of monitored services and resources. For more information, see Using the Monitoring Dashboard.
  • Performance Analysis: View key performance metrics and current data and history for each metric. For more information, see Analyzing Performance.
  • Log Analysis: Collect and view logs from monitored resources and convert them into metrics for monitoring. For more information, see Analyzing Logs.
  • Event Management: Set up notifications for specific conditions. For more information, see Managing Events.
Table. Exploring the Monitoring Page

Ending Monitoring

To exit the Cloud Monitoring Console, click the Log Out button in the top right corner of the User Information section.

Note
The session timeout for the Cloud Monitoring Console is set to 30 minutes.

Using Common Features

This section describes frequently used features when using the Cloud Monitoring Console.

Viewing Detailed Information

To view detailed information about a monitored resource, navigate to Cloud Monitoring Console > Performance Analysis or Cloud Monitoring Console > Log Analysis > Log Status. Then, click on the monitored resource for which you want to view detailed information.

Note
  • The detailed information for a monitored resource may vary depending on the service type.
  • If the operating system (OS) of the monitored resource is RHCOS (Red Hat Core OS), detailed information may not be available.
ItemDescription
Basic InformationDisplays basic information about the monitored resource
  • Example: Virtual Server - monitored resource, service type, service status, server type, OS information, IP
PerformanceDisplays key performance metrics as graphs
LogsDisplays the log collection volume as graphs
EventsDisplays a list of events that occurred on the monitored resource
AgentProvides Install, Start, Stop, Delete, and Update commands for the agent
Query Period SettingDisplays the query period for date/time data
  • Refreshes the data based on the current time.
  • Enables or disables automatic refresh.
  • Allows setting the data query period or changing the automatic refresh interval. For more information, see Setting the Query Period.
Monitoring Status AreaDisplays the monitoring status for performance, logs, and events.
Table. Monitored Resource Detailed Information
Note
  • Agent management commands are available for Virtual Server, GPU Server, and Bare Metal Server services.
  • For more information on agent installation and management, see Managing Agents.

Sorting Data

You can sort event monitoring, performance analysis, and log analysis results in descending or ascending order. To sort data, follow these steps:

  1. Display the information you want to sort on the page.
  2. Click on the Sort button next to the category name. The sort order changes between descending and ascending each time you click.

Viewing Real-Time Data

You can set the dashboard or detailed information page to automatically refresh the data at a specified interval.

Note
  • The Cloud Monitoring Console allows you to set the monitoring page to refresh periodically.
  • You can refresh the data based on the current time by clicking the Refresh button.

To set the refresh interval, follow these steps:

  1. Click the Settings button in the top right corner of the data representation area.
  2. Select the refresh interval and click OK.
  3. You can enable or disable the automatic refresh feature.

Setting the Query Period

You can set the query period to limit the scope of performance, log, and event data, making it easier to find the information you need. To set the query period, follow these steps:

  1. Click the Settings button in the top right corner of the data representation area.
  2. Select or enter the query period.
Caution
  • When entering the query period manually, it must be at least 30 minutes.
  • If the data query range is fixed for each widget, the widget’s query range takes priority.

2.1 - Using the Monitoring Dashboard

The monitoring dashboard allows you to view the operational status and event history of monitored services and resources, as well as the top usage items.

Getting Started with the Monitoring Dashboard

When you navigate to the Cloud Monitoring Console page in the Samsung Cloud Platform Console, the monitoring dashboard is displayed. If you are on a different page, you can click Cloud Monitoring Console > Monitoring Dashboard to move to the monitoring dashboard page.

The monitoring dashboard is composed of the following elements.

ItemDescription
Data Reference TimeDisplays the reference time for the data shown on the dashboard
RefreshRefreshes the dashboard based on the current time
Auto RefreshEnables or disables the auto-refresh feature for the dashboard
Period SettingSets the data retrieval period or changes the refresh cycle
Monitoring StatusDisplays the number of monitored targets and their monitoring status for each service
Event HistoryDisplays the number of events that occurred in the last 7 days as a graph by risk level
Top 5 Performance UsageDisplays the top 5 monitored targets with the highest performance usage as a graph
Event MapDisplays the number of events that occurred for each service by risk level
Event StatusDisplays a list of unprocessed events that have occurred
Table. Monitoring Dashboard Composition
Note
  • The monitoring dashboard is automatically created when you create an account in the Samsung Cloud Platform Console and cannot be deleted.
  • The widgets that make up the monitoring dashboard cannot be changed.
  • To create a dashboard with a specific widget, use a custom dashboard. For more information on custom dashboards, see Using Custom Dashboards.

Understanding Common Dashboard Features

This section describes the features that can be used in the dashboard.

Downloading Widget Images

You can download a widget as an image file (*.png) by clicking the download button in the top-right corner of the widget area.

Viewing Detailed Graph Information

When you hover over a graph with your mouse cursor, detailed information appears in a popup.

Monitoring Status

Displays the number of monitored targets and their monitoring status for each service in use.

ItemDescription
Service CategoryDisplays the service category and the number of monitored targets for each service category
  • Clicking on a service category displays the list of services and the number of monitored targets included in the category
Service ListDisplays the list of services and the number of monitored targets included in the service category
  • Clicking on the number of monitored targets for each service moves to the Performance Analysis page
Monitoring StatusDisplays the number of monitored targets and their current status
  • Clicking on the Down or Unknown items displays the service name in a popup
Event StatusDisplays the number of events that have occurred, classified by risk level (Fatal, Warning, Inform)
Note
  • The performance collection in the monitoring status displays the combined number of performance items for both Agent and Agentless methods.

Event History

Displays the number of events that occurred in the last 7 days as a graph by risk level.

  • When you hover over the graph with your mouse cursor, the event risk level and the number of occurrences for the selected date appear in a popup.
  • Occurrences: The total number of events that occurred
  • Active: The number of events that continue to occur because they meet the event occurrence conditions
  • Inactive: The number of events that no longer occur because they do not meet the event occurrence conditions
  • You can click on the risk level legend area to hide or show the corresponding graph.

Top 5 Performance Usage

Displays the top 5 monitored targets with the highest performance usage as a graph.

  • When you hover over the graph with your mouse cursor, the full name of the selected target and its current performance value appear in a popup.
  • Clicking on the graph opens the Monitored Target Details popup window for the corresponding target.
ItemDescription
CPU Usage/Core [Basic]The percentage of CPU time used, excluding Idle and IOWait states
Memory Used [Basic]The current amount of used memory
Disk Read Bytes [Basic]The number of disk read bytes
Disk Write Bytes [Basic]The number of disk write bytes
Note
  • The monitoring dashboard only displays the performance of Virtual Servers. To display the top 5 performance of other service types, you must select and configure them in a custom dashboard.

Event Map

Displays the number of events that occurred for each service by risk level.

  • When you hover over a square with your mouse cursor, the name of the monitored target appears in a popup.
  • Clicking on a service item in the event map opens the Monitored Target Details popup window for the corresponding service.

Each item’s risk level is as follows.

ItemDescription
No RuleA state where it is impossible to determine whether it is normal or abnormal. It means that there is no threshold setting value, so the state cannot be determined.
NORMALA normal state. It means that the threshold was not exceeded, so no event occurred.
INFORMThe lowest level of risk. It includes simple notification-level information.
WARNINGA medium level of risk.
FATALThe highest level of risk.

Event Status

Displays a list of events that have occurred and are still active.

  • Events are displayed in the order they occurred most recently.

2.2 - Analyzing Performance

In Performance Analysis, you can check the main performance items of the monitoring target and view the current data and history of each performance item. Users can check the performance status of the monitoring target they manage by service or period and analyze the results by comparing specific performance.

Getting Started with Performance Analysis

You can start performance analysis by selecting a monitoring target directly or entering search conditions. To analyze performance by searching for a monitoring target, follow these steps:

  1. Click Cloud Monitoring Console > Performance Analysis. You will be moved to the Performance Analysis page.
  2. Enter the search conditions for the monitoring target you want to analyze in the search area, and then click Search.
ItemDescription
Search AreaDetailed search filters are displayed differently in the search area depending on the service type
  • Click the Detailed Search button to perform a detailed search.
  • Multiple condition items can be selected for each detailed search filter
Number of Monitoring Targets DisplayedDisplays the number of search results and the number of performance items that can be viewed at a time in the list
  • The default value for the number of performance items displayed in the list is 20.
  • The number of items listed can be changed to 10, 20, 30, 40, 50, or 100
Search InformationDisplays the search result values for the search condition items
  • Monitoring target, service status, event level
  • Clicking the risk icon displayed in the event risk will open a popup window with the most recent event details for that risk.
Performance IndicatorInformation Displays the main performance indicators for the monitoring target based on the service type
  • Refer to the list of main performance indicators by service and the collection information by instance type and status for DB services
Detailed ViewCheck the detailed information of the corresponding monitoring target
Performance ComparisonSelect a monitoring target to compare performance
Table. Performance Analysis

Checking Performance Details

To check the detailed performance information of a monitoring target, follow these steps:

  1. Click the monitoring target you want to check in the performance analysis list. The Monitoring Details popup window will open.
  2. Click the Performance tab.
    • When you place the mouse cursor over the graph, the values of each performance item will appear in a popup window.
    • You can set the query period or change the refresh cycle by clicking the icon in the top right corner.
    • You can select the graph display method by clicking the Detailed or Summary buttons in the top left corner of the performance chart.
ItemDescription
Basic InformationDisplays basic information about the monitoring target
DetailedDisplays the performance chart of the monitoring target in detail
  • Check one chart in detail
SummaryDisplays the performance chart of the monitoring target in a checkerboard format
  • Check multiple charts at a glance
Query Period Setting
  • Date/Time: Displays the query base time of the data.
  • Refresh: Refreshes the data directly to the current time.
  • Start/Stop: Enables or disables the automatic refresh function.
  • Settings: Sets the data query period or changes the automatic refresh cycle
Performance ComparisonCreates a chart to compare the performance of the monitoring target and makes it possible to compare each performance
Performance ChartDisplays the performance chart of the monitoring target as a graph
  • If there is only one graph, the last collected value is displayed with the unit in the top right corner.
  • If there are multiple graphs, ⓘ is displayed in the top right corner, and when you place the mouse cursor over it, the last collected value of each graph appears in a popup window.
  • When you place the mouse cursor over the graph, the performance item value at the specified time appears in a popup window.
Table. Monitoring Target Details
Note
  • The collection cycle of performance values may vary depending on the service.
  • The data in the chart is expressed in 30 points, and the data collection interval according to the data query range (time) is as follows. (The expression point may vary depending on the collection time error)
    • 30 minutes: approximately 1-minute interval
    • 60 minutes: approximately 2-minute interval
    • 3 hours: approximately 6-minute interval
    • 6 hours: approximately 12-minute interval
    • 12 hours: approximately 24-minute interval
    • 24 hours: approximately 48-minute interval
    • 3 days: approximately 144-minute interval (2 hours 24 minutes)
    • 7 days: approximately 336-minute interval (5 hours 36 minutes)
    • 14 days: approximately 672-minute interval (11 hours 12 minutes)
    • Custom: The user-specified range (minutes) divided by 30
  • Each point’s data is expressed as the maximum value in the query range (time), and you can change the statistical type in the detailed chart.

Comparing Performance

You can check the performance items of each monitoring target and select the desired performance items to compare.

Getting Started with Performance Comparison

You can create a chart to compare the performance of the monitoring target and compare each performance.

Note
  • Only performance items of the same service type can be compared.
  • Performance items may be added depending on the detailed attributes of the service type.
    • Windows OS performance of VM
    • Kibana-related performance of Search Engine

To start performance comparison, follow these steps:

  1. Click Cloud Monitoring Console > Performance Analysis. You will be moved to the Performance Analysis page.

  2. Enter the search conditions for the monitoring target you want to analyze in the search area, and then click Search.

  3. Select all the monitoring targets you want to compare and click Performance Comparison. A popup window will open where you can compare performance.

    ItemDescription
    Monitoring TargetDisplays the service type of the monitoring target to be compared. Click to change the service
    • Changing the service will delete all charts created so far.
    • Click Add to search for and add the monitoring target of the currently selected service
    • The selected monitoring target is displayed on the page, and you can delete the monitoring target by clicking X or Delete All
    Performance ItemDisplays all performance items collected by the currently selected service
    • Check the performance items you want to compare, and they will be included in the chart.
    Chart Display MethodSelects the display method for the performance comparison chart
    • Detailed: The performance comparison chart is displayed in detail (default)
    • Summary: The performance comparison chart is displayed briefly
    Query Period Setting
    • Date/Time: Displays the query base time of the data
    • Refresh: Refreshes the data directly to the current time.
    • Start/Stop: Enables or disables the automatic refresh function.
    • Settings: Sets the data query period or changes the automatic refresh cycle
    Chart AreaDisplays a chart comparing the performance of the monitoring targets based on the selected performance items
  4. Click Add. A popup window will open where you can add a monitoring target.

  5. Select the monitoring target you want to compare and click OK.

    • If you select Kubernetes Engine, you must also select its subtype.
  6. Check the performance items you want to compare. The corresponding performance items will be added to the chart.

Checking the Chart

The performance comparison result is displayed as a chart. You can change the shape of the created chart or download it as an image or Excel file.

  1. When you place the mouse cursor over the graph, the performance item value at the specified time appears in a popup window.
  2. You can hide or show the graph by clicking the target item in the legend area.
    ItemDescription
    Statistical MethodSets the statistical method to be displayed as a graph
    • Displays statistics from 5 minutes to 6 hours.
    • Basic, Maximum, Minimum, Average, Sum can be selected, and multiple methods can be selected at the same time. The selected items are displayed in the legend area
    Chart TypeSelects the type of graph to be displayed in the chart
    • Line: Line graph
    • Stacked Area: Area graph
    • Scatter: Scatter graph
    Chart DownloadChecks and downloads the raw data of the chart
    • Chart PNG File: Downloads the chart as an image file (PNG).
    • Chart Excel File: Downloads the data of the performance items displayed in the chart as an Excel file. The chart display data is a set of data automatically collected according to the query range.
    • Raw Excel File: Downloads all the data of the performance items displayed in the chart for the query range period as an Excel file.
    Time Series Graph Widget AdditionAdds the chart to a user-defined dashboard as a time series graph widget
    • Clicking will open a popup window to add a time series graph widget.
    DeleteDeletes the performance comparison result chart
    Performance Comparison StatusDisplays the performance comparison result as a graph
    • When you place the mouse cursor over the graph, the performance comparison status at that time is displayed in a popup window.

2.3 - Analyzing Logs

In log analysis, the log of the monitoring target is collected and its contents are checked, and it can be monitored by converting it into an indicator, which is a structured data. Basic collection logs are provided for each monitoring target, and users can create custom logs to collect and check desired logs in addition.

Reference
To use log analysis, you must install and operate a log collection agent in advance. For more information on installing and operating log agents, please refer to Managing Agents. To collect logs from Kubernetes Engine, you must set up log collection in the Samsung Cloud Platform Console.

Start log analysis

You can check the log status list or search for the monitoring target log to check. To check the log status list, follow the procedure below.

  1. Cloud Monitoring Console > Log Analysis > Log Status을 클릭하세요. Log Status 페이지로 이동합니다.
  2. Enter the search conditions of the service to be analyzed in the search area, and then click Log Search.
    • A list of services that match the search criteria and search information will be displayed at the bottom.
    • Clicking the Detail View button for each service displays detailed log information for the service.
      ItemDescription
      Search AreaThe search filters displayed in the search area may vary depending on the service type
      • Advanced Search can be done by clicking the Advanced Search button.
      • Each detailed search filter condition item can be selected one or more
      Number of items to display for monitoring targetsDisplays the number of search results and the performance number that can be viewed at once in the list
      • The default is to view 20 at a time.
      • The number of items listed can be changed to view 10, 20, 30, 40, 50, or 100 at a time
      Search InformationDisplays the search result value for the search condition item
      Detailed ViewCheck the detailed information of the corresponding monitoring target
      Log SearchSearch logs by combining keywords and queries and check detailed history
Reference
If a Virtual Server or Node connected to the monitoring target exists, the status diagram will also be displayed in the search information area. The name of the monitoring target can use Korean, English uppercase and lowercase letters, numbers, and special symbols (-, _, .) and can be entered up to a maximum of 100 characters. If the monitoring target does not have permission, information about the target without permission and a permission check message will be displayed as a pop-up.

Check log details

You can view the detailed log records and log graphs of the monitoring target.

Checking the log list

You can check the log details in the monitoring detail pop-up window. To check the monitoring details of the log, follow the next procedure.

  1. Cloud Monitoring Console > Log Analysis > Log Status을 클릭하세요. Log Status 페이지로 이동합니다.

  2. Click on the log to check the detailed information on the Log Status page. The Monitoring Details popup window will open.

  3. Click the log tab. When you place the mouse cursor on the graph, the value of each log item appears in a popup window.

    • You can set the inquiry period or change the refresh cycle by clicking the icon at the top right.
    • You can select the graph display method by clicking the Detail, Summary buttons at the top left of the log chart.
    ItemDescription
    Basic InformationDisplays basic information about the monitoring target
    DetailsThe chart for each log of the monitoring target is unfolded and displayed
    • Check one chart in detail
    SummaryPerformance charts of monitoring targets are displayed in a checkerboard format
    • Check multiple charts at a glance
    Setting the inquiry period
    • Date/Time: Displays the standard time of the data inquiry.
    • Refresh: Refreshes directly to the current time.
    • Start/Stop: Turns the automatic refresh function on or off.
    • Settings: Sets the data inquiry period or changes the automatic refresh cycle
    Performance comparisonCombine keywords and queries to search logs and check detailed history
    Performance-based chartThe log-based chart of the monitoring target is displayed as a graph
    • If you place the mouse cursor over the graph, the log item value at the specified time will appear in a popup window.

Check by searching the log

You can search logs by combining keywords and queries, and check the details.

Reference
The presence and frequency of keywords can be converted into indicators and displayed as charts on the dashboard page, or set up related events to receive notifications.

To search logs, follow the next procedure.

  1. Cloud Monitoring Console > Log Analysis > Log Status을 클릭하세요. Log Status 페이지로 이동합니다.

  2. Click Log Status on the Log Search page. It moves to the Log Search page.

    ItemDescription
    Monitoring targetIndicates the type of service for the monitoring target to be compared
    • Click the monitoring target list to change the service
    • If the service is changed, all charts created so far will disappear.
    • Click the Add button to search for and add the monitoring target of the currently selected service
    • The selected monitoring target is displayed on the page, and you can delete the monitoring target by clicking X or Delete all.
    Search ConditionSet the condition for the log to be searched
    Setting the inquiry period
    • Date/Time: Displays the standard time of the data inquiry.
    • Refresh: Refreshes directly to the current time.
    • Start/Stop: Turns the auto-refresh function on or off.
    • Settings: Sets the data inquiry period or changes the auto-refresh cycle
    The graph of log occurrenceLog occurrence graph
    Occurrence log messageLog messages that occurred from the monitoring target are displayed by time
  3. Add button is clicked. A popup window that can add a monitoring target will open.

  4. Click the monitoring target and select the log file you want to add.

  5. Once the log file selection is complete, click the Confirm button.

  6. Enter the search conditions and click the Search button. The search results will be displayed on the log volume graph and the occurred log message.

    ItemDescription
    Add MetricAdd a metric to the log search results
    • Use after searching logs
    Execution HistoryCheck the list of search conditions that were recently executed for the search
    • Execution history displays up to 20 most recently executed search conditions
    • Select the desired execution history to input as the current search condition
    Search fieldSelect search field
    ConditionSelect search condition
    • like , !like , = , != , <= , >= , > , < can be selected
    Search valueEnter the keyword to search
    Log SearchSelect an operator (AND, OR) for the newly added search condition
    • Only displayed when a new search condition is added
    Add conditionAdd new search condition

  7. When searching logs, the log history corresponding to the entered condition is displayed as a chart.

    • Log history is displayed in seconds.
      ItemDescription
      Log occurrence graphThe log occurrence during the set period is displayed as a graph
      • If you place the mouse cursor over the graph, the value of each log item appears in a popup window.
      • Clicking on the bar graph of the graph displays the list of logs at that point in time.
      Setting the inquiry period
      • Date/Time: Displays the reference time of the data
      • Refresh: Refreshes directly to the current time.
      • Start/Stop: Turns the automatic refresh function on or off.
      • Settings: Sets the data inquiry period or changes the automatic refresh cycle
      Monitoring targetThe monitoring target list is displayed
      • If you select a monitoring target to check the log message, the contents will be displayed in the log list
      Log listLog messages that occurred in the monitoring target are displayed by time
      • Clicking the button in the log list displays the full message of the log
      • Click download to download the currently displayed log message in Excel and TXT file formats

Check the status of log collection

You can check the collection information of major logs for the past 7 days in a chart.

When you place the mouse cursor on the graph, detailed information appears in a pop-up window. Only collected logs are aggregated, and uncollected logs are not displayed in the current status.

Reference

When you create an * Account, it provides 1GB of virtual capacity by default to store the collected logs.

  • All logs can be stopped and restarted for collection as needed.

To check the log collection status, click Cloud Monitoring Console > Log Analysis > Log Collection Dashboard.

ItemDescription
Accumulated log occurrence amountAmount of logs collected from the 1st of each month, displayed in GB
  • Displays the cumulative usage of the allocated total virtual capacity so far, as a percentage.
Recent 7-day log collection amountThe amount of logs collected over the past 7 days is displayed in a graph by service type
  • The line graph with a notch shows the quantity (kb), and the bar graph shows the cumulative usage rate.
  • Click on the monitoring target in the legend area to display only the corresponding graph
Service-specific log occurrence rateDisplays the log collected over the past 7 days, classified by service
  • When you click on the bar graph representing each service, the monitoring target with the most collected logs within the service is displayed on the log collection TOP 10 chart.
Log Collection Top 10Displays the top 10 monitoring targets with the most logs collected in the last 7 days within the selected service in the log occurrence rate by service as a graph
  • Click on each point on the graph to view the detailed log records
  • Click on the monitoring target in the legend area to display only the corresponding graph
  • Clicking on the graph of the target service moves to the Log Status page
Reference
To perform monitoring related to logs, you must install and operate the log collection agent in advance. For more information on installing and operating log agents, please refer to Managing Agents. The accumulated log is stored up to a maximum of 1GB. If 1GB is exceeded, old logs are automatically deleted from oldest to newest.

Check the status of the indicator settings

You can create metrics to display the occurrence of log patterns over time as a time series. To check the list of metrics, click Cloud Monitoring Console > Log Analysis > Current Metric Settings.

Reference
The metrics converted to time series data can be set as an event or registered on the dashboard for real-time monitoring.
ItemDescription
Search areaThe search filter displayed in the search area may vary depending on the service type
  • Advanced search can be done by clicking the Advanced search button.
  • Each detailed search filter condition item can be selected one or more
Number of items to display for monitoring targetsDisplay search results
  • The default is to display 20 at a time.
  • The number of items listed can be changed to display 10, 20, 30, 40, 50, or 100 at a time
Search InformationDisplays the search result value for the search condition item
AddAdd a new indicator
DeleteSelect and delete indicators in search information

Check the details of the indicator

To view detailed information about the metric, follow these steps.

  1. Cloud Monitoring Console > Log Analysis > Metric Setting Status will be clicked. It moves to the Metric Setting Status page.
  2. On the Indicator Setting Status page, click the indicator name to check detailed information. The Indicator Details popup window will open.

Adding Indicators

You can add new metrics to display the desired log data as a time series.

Reference
The log indicator can only be set for the monitoring target where the log agent is installed or logs are collected. For more information on installing and operating log agents, see Managing Agents.

To add a new metric, follow the procedure below.

  1. Cloud Monitoring Console > Log Analysis > Metric Setting Status will be clicked. It moves to the Metric Setting Status page.

  2. On the Indicator Setting Status page, click the Add button. The Add Indicator popup window opens.

  3. Indicator Name을 입력하세요.

    • Indicator names can only use English uppercase and lowercase letters, underscores (_), periods (.), and hyphens (-).
    • To distinguish metrics from general performance, the prefix metricfilter. is automatically added and cannot be deleted or changed.
    ItemDescription
    Indicator NameEnter the name of the new indicator to be created
    Monitoring TargetIndicates the type of service for the monitoring target to be compared
    • Click the monitoring target list to change the service
    • If the service is changed, all charts created so far will disappear.
    • Click the add button to search for and add the monitoring target of the currently selected service
    • The selected monitoring target is displayed on the page and can be deleted by clicking X or delete all
    Search ConditionsSet conditions for logs to be searched
    Set query period
    • Date/Time: Displays the reference time for data query
    • Refresh: Refreshes directly to the current time.
    • Start/Stop: Turns automatic refresh on or off.
    • Settings: Allows setting the data query period or changing the automatic refresh cycle.
    Log Volume GraphWhen searching for logs, the log history that matches the entered conditions is displayed as a chart
    Occurrence Log MessageLog messages that occurred from the monitoring target are displayed by time
  4. Add button is clicked. A popup window that can add monitoring targets will be opened.

  5. Click the monitoring target and select the log file you want to add.

  6. Once the log file selection is complete, click the Confirm button.

  7. Enter the search conditions and click the Search button. The search results will be displayed in the log volume graph and occurrence log message.

    ItemDescription
    Add MetricAdd metrics to log search results
    • Use after searching logs
    Execution HistoryCheck the list of search conditions that were recently executed for searching
    • Execution history displays up to 20 most recently executed search conditions
    • Can input the desired search history as the current search condition
    Search FieldSelect Search Field
    ConditionSelect search condition
    • like , !like , = , != , <= , >= , > , < can be selected
    Search valueEnter the keyword to search
    OperatorSelect an operator (AND, OR) for the newly added search condition
    • Only displayed when a new search condition is added
    Add conditionAdd new search condition
  8. Click the Confirm button. A new metric will be added with a toast popup message.

Modifying Indicator Search Conditions

To modify the search criteria of the indicator, follow the next procedure.

  1. Cloud Monitoring Console > Log Analysis > Metric Setting Status will be clicked. It moves to the Metric Setting Status page.
  2. On the Indicator Setting Status page, click the Indicator Name of the indicator you want to modify. The Indicator Details popup window will open.
  3. Indicator Details popup window, click the Edit button. The Edit Indicator popup window opens.
  4. In the Modify Indicator popup window, modify the search conditions and click the Confirm button. The indicator will be modified along with a toast popup message.

Deleting Indicators

To delete an indicator, follow these steps.

Reference
If there are charts or event policies using the metric you want to delete, you cannot delete the metric.
  1. Cloud Monitoring Console > Log Analysis > Metric Setting Status을 클릭하세요. Metric Setting Status 페이지로 이동합니다.
  2. On the Indicator Setting Status page, select the indicator to be deleted and click the Delete button. The indicator will be deleted along with a toast popup message.

2.4 - Managing Events

An event is a setting that alerts users when the performance value of a monitored target meets certain conditions. By setting up events, users can grasp important monitoring information without missing it. For example, if an event is set to occur when a performance value related to overload exceeds a certain threshold, users will receive notifications whenever there is a risk of overload while operating the resource. Users can then take action before problems occur.

In event management, users can create events to alert designated users when specific values occur during monitoring.

Checking Event Status

The event status section displays information about all occurred events, related performance items, and event notification history. To check the event status list, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Event Status. The Event Status page will be displayed.
  2. On the Event Status page, enter the search conditions for the service you want to check in the search area, and then click the Search button.
    ItemDescription
    Search AreaThe search filter displayed in the search area varies depending on the service type.
    • Click the Detailed Search button to perform a detailed search.
    • Multiple conditions can be selected for each detailed search filter.
    Number of Monitoring Targets DisplayedDisplays the number of search results and the number of performance items that can be viewed at once in the list.
    • The default value for the number of performance items displayed in the list is 20 per page.
    • The number of performance items displayed in the list can be changed to 10, 20, 30, 40, 50, or 100 per page.
    Search InformationDisplays the search result values for the search condition items.
    • Clicking on the message content for each service allows you to check the detailed information of the event.
    Detailed ViewDisplays detailed information about the corresponding monitoring target.
    Table. Event List
Note
  • If a Virtual Server or Node is connected to the monitoring target, the status will also be displayed in the search information area.
  • The name of the monitoring target can include Korean, English ( uppercase and lowercase), numbers, and special characters (-, _, .), with a maximum of 100 characters.

Viewing Event Status List

In the monitoring detail popup window, you can check the event information, occurrence time, and duration. To check the event occurrence status, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Event Status. The Event Status page will be displayed.
  2. On the Event Status page, click the Event tab.
    ItemDescription
    Event StatusDisplays the event message and occurrence time.
    ActiveDisplays only events that are currently active.
    AllDisplays all events.
    Event DetailsDisplays detailed information about the selected event message.
    Table. Event Tab

Checking Event Details

To check the event details, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Event Status. The Event Status page will be displayed.
  2. On the Event Status page, click the Event tab.
  3. On the Event Status page, select the event you want to check the details for, and then click Event Details. You can then check the event issuance conditions, performance items, and notification history.
ItemDescription
Monitoring TargetDisplays the name of the monitoring target.
Occurrence ConditionDisplays the occurrence condition of the event.
Performance ItemDisplays a chart for the performance item.
  • Placing the mouse cursor over the graph displays the detailed performance value for each time period.
Notification HistoryDisplays the entire notification history.
Event Setting DetailsDisplays the setting information for the corresponding event.
Table. Event Details

Managing Event Settings

You can set up detailed event settings, such as the monitoring target, performance value that serves as the basis for event occurrence, event risk level, and event notification recipient. When the data collected from the monitoring target meets the conditions set in the event policy, notifications are sent to users via email, SMS, or messaging.

Note
  • Event policies can only be set when a monitoring target is specified, and policies for Auto-Scaling Groups can be set on a group-by-group basis.

Checking Event Settings

To check the event settings, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Event Settings. The Event Settings page will be displayed.
  2. On the Event Settings page, enter the search conditions for the service you want to check in the search area, and then click the Search button.
    ItemDescription
    Search AreaThe search filter displayed in the search area varies depending on the service type.
    • Click the Detailed Search button to perform a detailed search.
    • Multiple conditions can be selected for each detailed search filter.
    Number of Monitoring Targets DisplayedDisplays the search results.
    • The default value is 20 per page.
    • The number of performance items displayed in the list can be changed to 10, 20, 30, 40, 50, or 100 per page.
    Monitoring TargetDisplays the name of the monitoring target.
    • Checking the box selects the monitoring target, and the Delete, Activate, and Notification Recipient buttons are activated.
    Performance ItemDisplays the performance item that is the target of the event setting.
    Individual ItemDisplays the individual performance item under the performance item.
    • If there are no individual items, they will not be displayed.
    Type/UnitDisplays the value type and unit of the corresponding performance item.
    Event LevelDisplays the risk level of the corresponding event.
    • The risk level is set by the user when adding an event.
    • Fatal: The most critical level.
    • Warning: The middle level of risk.
    • Information: The lowest level of risk and reference level.
    ThresholdDisplays the reference value used to compare the performance value.
    Notification RecipientDisplays the recipient of the event notification.
    • Placing the mouse cursor over the name displays the entire list.
    Policy StatusDisplays whether the event is activated or not.
    Detailed ViewDisplays detailed event information and allows modification.
    • Clicking Detailed View opens a popup window with detailed information about the corresponding event.
    AddAdds an event.
    DeleteDeletes an event.
    ActivateActivates or deactivates an event.
    Notification RecipientDisplays and manages event notification recipients.
    Table. Event Settings
Note
  • The name of the monitoring target can include Korean, English (uppercase and lowercase), numbers, and special characters (-, _, .), with a maximum of 100 characters.
  • If you do not have permission for the monitoring target, a message will be displayed indicating that you do not have permission, along with the target information.

Checking Detailed Event Settings

You can check detailed information about the monitoring target and event conditions, and modify the event conditions and notification information.

Adding Event Settings

To add event settings, follow these steps:

Note
  • Event policies can only be set when a monitoring target is specified.
  • Policies for Auto-Scaling Groups can be applied on a group-by-group basis.
  1. Click Cloud Monitoring Console > Event Management > Event Settings. The Event Settings page will be displayed.

  2. On the Event Settings page, click the Add button. The Add Event Settings popup window will be displayed.

    ItemDescription
    Target NameSelect the monitoring target to add event settings for.
    • Clicking the monitoring target list changes the service.
    • Changing the service will delete all event conditions created so far.
    • Click the Add button to search for and add the monitoring target of the currently selected service.
    • The selected monitoring target is displayed on the page, and you can delete the monitoring target by clicking the X or Delete All button.
    Event Settings AreaSet the performance and occurrence conditions for the event.
    Notification Information AreaSet the notification recipient and notification method for the event.
    Table. Add Event Settings Popup Window

  3. In the monitoring target area, select the service type and then click the Add button. The Add Monitoring Target popup window will be displayed.

  4. Select the monitoring target and then click the Confirm button.

    • You can select multiple monitoring targets at the same time.
    • If there are multiple monitoring targets, the set event will be added to each monitoring target.
    • If you select Kubernetes, you must also select the subtype.
  5. In the performance item area, click the performance item you want to add an event for, and then enter the event occurrence condition.

    • The number of times the performance item is added is displayed next to the performance item name.
    • If you select multiple performance items, you must enter the event occurrence condition for each performance item.
      ItemDescription
      Event Policy TemplateSelect an existing event policy template to apply.
      Performance ItemClick the performance item to set the event occurrence condition.
      Event LevelSet the event level.
      • Fatal: The most critical level.
      • Warning: The middle level of risk.
      • Information: The lowest level of risk and reference level.
      Performance TypeSelect the reference value used to determine whether the event occurs.
      • Collected Value: Uses the current value.
      • Delta Value: Uses the difference between the previous value and the current value.
      ThresholdSet the reference value used to compare the performance value.
      • This is the criterion for determining whether the event occurs.
      • Only numbers and decimal points can be entered.
      Comparison MethodSelect the method used to compare the performance value and the threshold.
      • Range: Checks if the performance value is within the specified range of the threshold.
      • Match: Checks if the performance value matches the threshold.
      • Mismatch: Checks if the performance value does not match the threshold.
      • Greater Than: Checks if the performance value is greater than the threshold.
      • Greater Than or Equal To: Checks if the performance value is greater than or equal to the threshold.
      • Less Than: Checks if the performance value is less than the threshold.
      • Less Than or Equal To: Checks if the performance value is less than or equal to the threshold.
      Individual ItemSpecifies the individual performance item under the performance item as the event condition.
      • This is only activated if the performance item can collect individual items.
      PrefixAdds a prefix to the event message.
      • This is used as a keyword to search for the event in the Event Status page.
      StatisticsSets the statistical method to apply to the collected performance values.
      • If a statistical method is set, the performance value to which the statistical method is applied is compared to the threshold to determine whether the event occurs. If not set, the most recent performance value is compared to the threshold.
      • Statistical Method: Selects one of the maximum, minimum, average, or sum to calculate the collected performance values.
      • Statistical Period: Sets the period for which the statistical method is applied. This is the period from the most recently collected performance value.
      Continuous Occurrence CountSets the number of consecutive monitoring values that meet the event occurrence condition.
      • This value is used as sensitivity to determine whether the event is a momentary anomaly or an actual event.
      Event Occurrence Notification TimeSets the time zone for event policy settings.
      Table. Add Event Settings - Event Settings Area
  6. In the Notification area, you can set up notifications.

    ItemDescription
    Notification Recipient Selection AreaSelect the notification recipient.
    • Clicking the Delete button after selecting the notification recipient deletes the recipient.
    Notification Recipient/GroupDisplays the list of recipients to whom the event notification will be sent.
    Event Risk LevelDisplays the risk level of the set event.
    Notification MethodDisplays the method used to send notifications to the recipient.
    AddAdds a new notification recipient from the address book.
    DeleteDeletes the notification recipient from the list.
    Table. Add Event Settings - Notification Information Area

  7. Check the notification recipient and then click the Confirm button.

Note
  • Only the Root user or IAM user of an account can be added as a notification recipient.
  • Multiple recipients can be selected at the same time.
  1. Set the notification method for each notification recipient based on the event risk level.

    • The notification method can be selected from email, SMS, or messaging, and multiple methods can be selected at the same time.
  2. After setting the notification method, click the Confirm button.

Modifying Event Settings

To modify the event conditions and notification recipient information, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Event Settings. The Event Settings page will be displayed.
  2. On the Event Settings page, enter the search conditions for the service you want to modify in the search area, and then click the Search button.
  3. In the event policy list, click the Detailed View button for the event policy you want to modify. The Event Setting Details page will be displayed.
  4. On the Event Setting Details page, click the Modify button. The Modify Event Settings page will be displayed.
  5. On the Modify Event Settings page, enter the modified information and then click the Confirm button.
    • You can modify the event conditions and notification information.

Deleting Event Settings

To delete event settings, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Event Settings. The Event Settings page will be displayed.
  2. On the Event Settings page, enter the search conditions for the service you want to delete in the search area, and then click the Search button.
  3. In the event policy list, check the event policy you want to delete and then click the Delete button.
  4. In the confirmation popup window, click the Confirm button.

Changing Event Setting Activation

You can easily change the activation status of event policies.

  1. Click Cloud Monitoring Console > Event Management > Event Settings. The Event Settings page will be displayed.
  2. On the Event Settings page, enter the search conditions for the service you want to delete in the search area, and then click the Search button.
  3. In the event policy list, check the event policy you want to change the activation status for and then click the Activate button. The Policy Activation popup window will be displayed.
  4. Select the activation status and then click the Confirm button.
    • You can change the activation status in bulk by clicking the Activate All or Deactivate All button.
Note
Deactivating an event policy will deactivate all active events that occurred due to the selected event policy.

Changing Event Notification Recipients

You can check and change the notification recipients for event occurrences in bulk.

Note
  • The event notification recipient change function is intended to change the notification recipients in bulk. Therefore, existing notification recipients will be deleted and changed to the new notification recipient settings.
  • To check and change the notification recipients for each policy, click the Modify button on the policy details page.
  1. Click Cloud Monitoring Console > Event Management > Event Settings. The Event Settings page will be displayed.

  2. On the Event Settings page, enter the search conditions for the service you want to delete in the search area, and then click the Search button.

  3. In the event policy list, check the event policy you want to modify and then click the Notification Recipient button. The Notification Recipient page will be displayed.

  4. On the Notification Recipient page, select the user to add as a notification recipient and then click the Confirm button.

    ItemDescription
    Event Policy ListDisplays the list of event policies to change the notification recipients for.
    • Click Add to add policies to change.
    • Clicking the Delete button for a policy deletes the policy.
    User Search AreaEnter the name, email, phone number, or company name to search for users.
    Notification Address BookUse the address book to check and add users.
    Search User ListDisplays the list of users included in the address book or search results.
    • Checking the user adds them to the notification recipient list.
    Notification Recipient ListDisplays the list of users to be added as notification recipients for the event policies displayed in the list.
    • Checking the user and clicking the Delete button removes the user from the list.
    Table. Changing Event Notification Recipients

Managing Event Templates

You can create event templates by setting monitoring targets, performance values that serve as the basis for event occurrence, and event risk levels. When adding or modifying events, you can use event policy templates to easily enter event conditions.

Checking the Event Policy Template List

To check the event policy template list, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Event Settings. The Event Settings page will be displayed.

  2. On the Event Settings page, click Event Policy Template. The Event Policy Template page will be displayed.

  3. On the Event Policy Template page, enter the search conditions for the service you want to check in the search area, and then click Search.

    ItemDescription
    Search AreaEnter the conditions for the event policy template to search
    Add Event Policy TemplateAdd an event policy template
    Template ListDisplays the event policy templates that match the search conditions
    Table. Event Policy Template List

Adding an Event Policy Template

To add an event policy template, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Event Settings. The Event Settings page will be displayed.

  2. On the Event Settings page, click the Event Policy Template button. The Event Policy Template page will be displayed.

  3. On the Event Policy Template page, click the Add Event Policy Template button. The Add Event Policy Template popup window will open.

  4. In the Add Event Policy Template popup window, set the service type and template information to add the event policy template.

    • * indicates required input items.
      ItemDescription
      Service TypeSelect the service type to set the event policy
      Template NameEnter the name of the template to create
      Template DescriptionEnter a description of the template to create
      Table. Adding an Event Policy Template - Service Type and Template Name Settings
  5. In the performance item section, click the performance item to add an event and enter the event occurrence conditions.

    • The number of times the performance item is added is displayed next to the performance name.
    • If multiple performance items are selected, event occurrence conditions must be entered for each performance item.
    • * indicates required input items.
      ItemDescription
      Load Event Policy TemplateSelect an existing event policy template to apply
      Performance ItemClick the performance item to set the event condition
      Event LevelSet the event risk level
      Performance TypeSelect the performance value to use as the basis for event occurrence
      ThresholdSet the threshold value to compare with the collected performance value
      Comparison MethodSelect the comparison method to determine event occurrence
      Individual ItemSpecify individual performance items as event conditions
      PrefixAdd a prefix to the event message
      StatisticsSet the statistical method to apply to the collected performance value
      Continuous Occurrence CountSet the number of consecutive monitoring values that meet the event occurrence conditions
      Event Occurrence Notification TimeSet the time zone for event policy settings
      Table. Adding an Event Policy Template - Performance Item
  6. Set the notification target and method when an event occurs.

    ItemDescription
    AddAdd a new notification target
    DeleteDelete the selected notification target
    Notification Target/GroupDisplays the list of notification targets
    Event Risk LevelDisplays the event risk level to be notified
    Notification MethodDisplays the notification method
    Table. Adding an Event Policy Template - Notification Target Settings

Note
  • Only account members and address book registered in the account can be added as notification targets.
  • Multiple targets can be selected at the same time.
  1. Click the Confirm button. The event policy template will be added, and a toast popup message will be displayed.

Modifying and Deleting an Event Policy Template

To modify or delete an event policy template, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Event Settings. The Event Settings page will be displayed.
  2. On the Event Settings page, click the Event Policy Template button. The Event Policy Template page will be displayed.
  3. On the Event Policy Template page, enter the search conditions for the service you want to check in the search area, and then click the Search button.
  4. Click the More button at the top right of the template you want to modify or delete, and then click Modify or Delete.
    • Modify: The template modification popup window will open. Modify the template and click the Confirm button.
    • Delete: The template will be deleted, and a toast popup message will be displayed.
  5. Click the Confirm button. The template will be deleted, and a toast popup message will be displayed.

Sharing an Event Policy Template

To share an event policy template, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Event Settings. The Event Settings page will be displayed.
  2. On the Event Settings page, click the Event Policy Template button. The Event Policy Template page will be displayed.
  3. On the Event Policy Template page, enter the search conditions for the service you want to check in the search area, and then click the Search button.
  4. Click the More > Share button at the top right of the template you want to share.
  5. Select the user to share with and click the > button. The selected user will be added to the shared target.
  6. Click the Confirm button. The template will be shared, and a toast popup message will be displayed.

Event Filtering

You can filter event notifications for a specific period. During the event filtering period, events will occur, but notifications will not be delivered.

To check the event filtering list, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Event Filtering. The Event Filtering page will be displayed.
ItemDescription
Filtering TimelineDisplays the registered filtering timeline by date
Filtering ListDisplays the registered filtering information and action status in a list
AddAdds a new event filtering
DeleteDeletes the selected event filtering
Search AreaSearches for event filtering or monitoring targets
Table. Event Filtering List
Note
The filtering timeline chart is displayed based on the time zone set for the logged-in user’s account.

Adding Event Filtering

To add event filtering, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Event Filtering. The Event Filtering page will be displayed.

  2. On the Event Filtering page, click the Add button. The Add Event Filtering popup window will open.

  3. In the Add Event Filtering popup window, enter the filtering information.

    ItemDescription
    Event FilteringEnter the name of the event filtering
    UsageSet the usage of the event filtering
    Time ZoneSet the time zone for the event filtering
    Repeat TypeSet the repeat type of the event filtering
    PeriodSet the period for the event filtering
    Event Filtering TargetSelect the service type and monitoring target to apply the event filtering
    Table. Adding Event Filtering

  4. Click the Confirm button. The event filtering will be added, and a toast popup message will be displayed.

Note
The event filtering modification task can be used to change the usage of the event filtering.

Modifying Event Filtering

To modify event filtering, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Event Filtering. The Event Filtering page will be displayed.
  2. On the Event Filtering page, click the name of the filtering you want to modify. The Event Filtering Details popup window will open.
  3. In the Event Filtering Details popup window, click the Modify button. The Modify Event Filtering popup window will open.
  4. In the Modify Event Filtering popup window, enter the modified contents and click the Confirm button. The event filtering will be modified, and a toast popup message will be displayed.

Deleting Event Filtering

To delete event filtering, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Event Filtering. The Event Filtering page will be displayed.
  2. On the Event Filtering page, select the event filtering you want to delete and click the Delete button. The event filtering will be deleted, and a toast popup message will be displayed.
    • Multiple event filterings can be selected at the same time.

Managing Notification Groups

You can manage notification targets as a group when an event occurs. Notification Groups can be used to efficiently manage notification targets and easily set up notifications.

To check the Notification Groups, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Notification Groups. The Notification Groups page will be displayed.
  2. On the Notification Groups page, you can check and manage the notification groups.
    ItemDescription
    Add Notification GroupAdds a new notification group
    Notification GroupDisplays all notification groups created by the user
    Detailed SearchSearches for notification groups by name
    Keyword SearchSearches for notification groups, user names, creation dates, and last modification dates
Note
Notification Groups are only valid within the account, so they can only be composed of users with access permissions to the account. Users who have been deleted from the access permissions are automatically excluded from the address book.

Adding a Notification Group

To add a notification group, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Add Notification Group.
  2. On the Add Notification Group page, enter the notification group name, description, and add users.
  3. Click the Save button. The notification group will be added.

Modifying a Notification Group

You can add users to a notification group or delete registered users from the group.

Adding Users

To add users to a notification group, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Notification Groups.
  2. Click the notification group you want to add users to, and then click Modify.
  3. Select the users to add.
    • Only account members can be added to the address book.
    • You can use the real-time search GUI to quickly find the desired members.
  4. Click the Save button. The user address will be added, and a toast popup message will be displayed.

Deleting a Notification Group

To delete a notification group, follow these steps:

  1. Click Cloud Monitoring Console > Event Management > Notification Groups.
  2. Click the notification group you want to delete.
  3. Select the notification group to delete and click Delete.
    • Multiple addresses can be selected at the same time.
  4. Click the Confirm button. The address will be deleted, and a toast popup message will be displayed.

2.5 - Using Custom Dashboards

A custom dashboard is a user-defined dashboard that allows users to select and arrange widgets according to their preferences. Users can customize the monitoring information and share the created custom dashboard with other users.
The following content is covered in Using Custom Dashboards.

Note
Custom dashboards are created separately from the Account dashboard and can display monitoring information from multiple Accounts at once.

Getting Started with Custom Dashboards

Users can create a custom dashboard and add desired widgets to view monitoring information.

Creating a Custom Dashboard

To create a custom dashboard, follow these steps:

  1. Click Custom Dashboard Management in the top-right menu. The Custom Dashboard Management page will be displayed.
  2. Click Add Dashboard. The Add Dashboard popup window will open.
  3. Enter the name of the dashboard to be created and click the Save button.
  4. The created custom dashboard will be displayed in the My Dashboards list.

Adding Widgets

Custom dashboards provide various types of widgets, such as performance statistics, comparison charts, and event lists. Users can add widgets to customize their dashboard according to their monitoring needs.

Note

To add a widget, follow these steps:

  1. Click Custom Dashboard Management in the top-right menu. The Custom Dashboard Management page will be displayed.
  2. Select the custom dashboard to add a widget from the My Dashboards list.
  3. Click the + button or Add Widget button in the top-right corner of the dashboard. The Add Widget popup window will open.
  4. Select the widget to add to the dashboard from the Add Widget popup window.
    • When a widget is selected, detailed settings and previews will be displayed.
    • For each chart, see Custom Widgets for explanations and setup methods.
  5. Click the Confirm button.
Note
Widgets are added to the dashboard with a default size.

Custom Widgets

The following types of widgets can be added to a custom dashboard:

Widget NameDescription
Title BoxDisplays a title box on the custom dashboard.
Event StatusDisplays the status of occurred events.
Monitoring StatusDisplays the number of monitoring targets and their status.
Top 5 PerformanceDisplays the top 5 monitoring targets with the highest performance usage rates.
Event MapDisplays the number of events occurred by service and risk level.
Event HistoryDisplays the number of events occurred by date and risk level.
Time Series GraphDisplays the performance of a selected monitoring target as a time series graph.
Status IndicatorDisplays the statistical values and risk levels of monitoring targets.
Instance MapDisplays the performance values of monitoring targets with different color densities.
Table. Custom Dashboard Widget Types

Title Box

Displays a title box on the custom dashboard.

  • Up to 10 title boxes can be created.
  • Multiple title boxes can be added at the same time.
ItemDescription
TitleEnter the text to be displayed on the title box.
AddAdds a new text box.
DeleteDeletes the corresponding text box.
Table. Custom Dashboard Title Box

Event Status

Displays the status of occurred events.

  • All occurred events can be displayed, or only active events can be displayed.
ItemDescription
Widget NameEnter the name of the widget.
Query RangeSelect the range of events to be displayed on the widget.
Table. Event Status

Monitoring Status

Displays the number of monitoring targets and their status.

ItemDescription
Widget NameEnter the name of the widget.
Table. Monitoring Status

Top 5 Performance

Displays the top 5 monitoring targets with the highest performance usage rates.

ItemDescription
Widget NameEnter the name of the widget.
ServiceSelect the service to check performance.
Performance ItemSelect the performance item to display.
Table. Top 5 Performance

Event Map

Displays the number of events occurred by service and risk level.

ItemDescription
Widget NameEnter the name of the widget.
Table. Event Map

Event History

Displays the number of events occurred by date and risk level.

ItemDescription
Widget NameEnter the name of the widget.
Table. Event History

Time Series Graph

Displays the performance of a selected monitoring target as a time series graph.

  • The time series graph can be changed using the dashboard’s query period setting feature.
  • When the mouse cursor is placed over the graph, the time and target performance values can be checked.
ItemDescription
Widget NameEnter the name of the widget.
ServiceSelect the service to check performance.
Monitoring TargetSelect the monitoring target to display on the graph.
Performance ItemSelect the performance item to display on the graph.
Add OptionRisk intervals can be displayed.
Table. Time Series Graph
Note

The graph type can be changed by clicking the icon in the top-right corner of the preview.

  • Line graph
  • Area graph
  • Cumulative bar graph
  • Scatter graph

Status Indicator

Displays the statistical values and risk levels of monitoring targets.

When the mouse cursor is placed over the status indicator on the monitoring dashboard, detailed information about the item can be checked.

ItemDescription
Widget NameEnter the name of the widget.
ServiceSelect the service to check performance.
Monitoring TargetSelect the monitoring target to display on the graph.
Performance ItemSelect the performance item to display on the graph.
StatisticsSelect the statistical method to display the performance values of the monitoring target.
Add OptionRisk intervals can be displayed.
Table. Status Indicator

Instance Map

Displays the performance values of monitoring targets with different color densities.

  • When the mouse cursor is placed over each heatmap, detailed information about the item can be checked.
ItemDescription
Widget NameEnter the name of the widget.
ServiceSelect the service to check performance.
Monitoring TargetSelect the monitoring target to display on the graph.
Performance ItemSelect the performance item to display on the graph.
Table. Instance Map

Viewing Custom Dashboards

To view a custom dashboard, follow these steps:

  1. Click Custom Dashboard Management in the top-right menu. The Custom Dashboard Management page will be displayed.
  2. Select the custom dashboard to view from the My Dashboards list.
ItemDescription
Dashboard ListDisplays the list of custom dashboards. The list can be clicked to change the dashboard to be viewed.
Dashboard NameDisplays the name of the user-defined dashboard.
Dashboard Settings
  • Date/Time: Displays the reference time for analysis information.
  • Refresh: Refreshes to the current time.
  • Stop/Start: Turns the automatic refresh feature on or off.
  • Settings: Allows setting the data query period or changing the automatic refresh cycle. (See Setting Query Periods)
Add WidgetAdds a new widget to the dashboard.
Edit DashboardAllows editing the currently set custom dashboard.
  • Modify Dashboard: Modifies the name of the currently selected dashboard.
  • Copy Dashboard: Copies the currently selected dashboard and creates a new custom dashboard with the same widgets.
  • Delete Dashboard: Deletes the currently selected dashboard.
  • Share Dashboard: Shares the dashboard with specific users so they can view it. For more information, see Sharing Custom Dashboards.
Custom WidgetsDisplays the widgets that make up the dashboard.
  • The position and size of widgets can be changed, or they can be modified or deleted. For more information, see Managing Custom Widgets.
  • Graphic widgets can be downloaded as image files.
Table. Custom Dashboard Information
Note
The star icon next to the dashboard name can be clicked to add the dashboard to favorites. Favorited dashboards are displayed at the top of the dashboard list.

Downloading Widgets

Graphic widgets can be downloaded as image files (*.png).
When the mouse cursor is placed over a graph widget, a download button will be displayed in the top-right corner. Clicking the download button will download the widget as an image file.

Sharing Custom Dashboards

Custom dashboards can be shared with other users so they can view the dashboard.

Note
Shared dashboards will remain shared even if the user is later removed from the current Account.

To share a custom dashboard, follow these steps:

  1. Click Custom Dashboard Management in the top-right menu. The Custom Dashboard Management page will be displayed.
  2. Select the custom dashboard to share from the My Dashboards list.
  3. Click the Share button next to the dashboard name.
  4. Enter the user ID or email address of the user to share the dashboard with and click the Share button.
  5. Click the Confirm button.

Managing Custom Dashboards

You can modify, copy, or delete custom dashboards.

  1. Click Manage Custom Dashboards from the top right menu. It moves to the Manage Custom Dashboards page.
  2. From the My Dashboard list, select the Custom Dashboard you want to check.
  3. Click the More button on the top right of the dashboard, then select the desired command.
    • Edit Dashboard: Modify the dashboard name.
    • Copy Dashboard: Copy the dashboard to create a new dashboard.
    • Share Dashboard: Share the dashboard with other users.
    • Delete Dashboard: Delete the dashboard.

Managing Custom Widgets

You can change the position and size of widgets or modify and copy them.

Changing Widget Position

You can change the position of a widget by clicking on its name and dragging it.

Changing Widget Size

To change the size of a widget, follow these steps:

  1. Place the mouse cursor over the widget. The Resize button appears at the bottom right of the widget.
  2. Click the Resize button and drag it to adjust the size as needed.

Modifying, Copying, and Deleting Widgets

To modify, copy, or delete a widget, follow these steps:

  1. Place the mouse cursor over the widget. The More button appears at the top right of the widget.
  2. Click the More button, then click the desired command.
    • Edit Widget: Modify the widget’s chart settings.
    • Copy Widget: Copy the widget to create a new widget with the same content.
    • Delete Widget: Delete the widget.

2.6 - Managing Agents

An agent is a module that collects performance values, logs, and Windows events from the monitoring target. To use the monitoring function, users must check the installation status of the agent and operate and manage it.

Note
  • If IP access control is set for the monitoring target, agent management cannot be used. If agent management is not available, check the IP access control setting status of the selected monitoring target.
  • The agent management function uses the sudo command, so the sudo package must be installed in advance.

Agent Management Overview

There are performance collection agents, log collection agents, and Windows event log collection agents.

  • Agents must be installed manually by the user on the monitoring target according to their needs.

Managing Agents

Managing Performance Agents

To install and manage agents, follow these steps.

  1. Click Cloud Monitoring Console > Performance Analysis. Move to the Performance Analysis page.
  2. On the Performance Analysis page, select the monitoring target and click the Details button. The Monitoring Target Details popup window opens.
  3. In the Monitoring Target Details popup window, click the Agent tab. Move to the Agent tab.
  1. Click the Performance button on the Agent tab.
  2. Click the Copy icon to the right of the installation command to copy the command.
  3. Paste the copied command into the monitoring target resource.
  4. Run the copied command on the monitoring target resource.
Note
The command uses the sudo command, so the sudo package must be installed.
ItemDescription
InstallationDownloads and runs the script file required for agent installation.
StartRuns the agent start command.
StopRuns the agent stop command.
DeleteRuns the agent delete command.
UpdateDownloads and runs the script file required for agent update.
Table. Managing Performance Agents
Note

To check the agent service status, use the following method:

  • Linux: $ sudo systemctl status metricbeat
  • Windows: Task Manager → service → metricbeat → Status(Running)

Managing Log Agents

To install and manage agents, follow these steps.

  1. Click Cloud Monitoring Console > Performance Analysis. Move to the Performance Analysis page.
  2. On the Performance Analysis page, select the monitoring target and click the Details button. The Monitoring Target Details popup window opens.
  3. In the Monitoring Target Details popup window, click the Agent tab. Move to the Agent tab.
  1. Click the Log button.
  2. Click the Copy icon to the right of the installation command to copy the command.
  3. Paste the copied command into the monitoring target resource.
  4. Run the copied command on the monitoring target resource.
Note
The command uses the sudo command, so the sudo package must be installed.
ItemDescription
InstallationDownloads and runs the script file required for agent installation.
StartRuns the agent start command.
StopRuns the agent stop command.
DeleteRuns the agent delete command.
UpdateDownloads and runs the script file required for agent update.
Table. Managing Log Agents
Note

To check the agent service status, use the following method:

  • Linux: $ sudo systemctl status filebeat
  • Windows: Task Manager → service → filebeat → Status(Running)

To add logs to be monitored, select the log addition action, enter the log name and log path correctly, and click the Generate Command button. Paste the generated command into the monitoring target resource and run it.

Managing Event Agents

To install and manage agents, follow these steps.

  1. Click Cloud Monitoring Console > Performance Analysis. Move to the Performance Analysis page.
  2. On the Performance Analysis page, select the monitoring target and click the Details button. The Monitoring Target Details popup window opens.
  3. In the Monitoring Target Details popup window, click the Agent tab. Move to the Agent tab.
  1. Click the Event button.
  2. Click the Copy icon to the right of the installation command to copy the command.
  3. Paste the copied command into the monitoring target resource.
  4. Run the copied command on the monitoring target resource.
Note
The event agent is available for Windows instances.
ItemDescription
InstallationDownloads and runs the script file required for agent installation.
StartRuns the agent start command.
StopRuns the agent stop command.
DeleteRuns the agent delete command.
UpdateDownloads and runs the script file required for agent update.
Table. Managing Event Agents
Note

To check the agent service status, use the following method:

  • Windows: Task Manager → service → winlogbeat → Status(Running)
Note
Agent command provision is provided separately from the instance status of Virtual Server (Bare Metal Server).

2.7 - Appendix A. Monitoring Targets by Service

Compute type

Virtual Server

DivisionMonitoring TargetCollection MethodCollection Cycle
PerformanceOSAgent
Agentless
1m
LogOSAgentLog Occurrence Time
StatusOSAgentless1m
Fig. Virtual Server Monitoring Information
Reference
If the Virtual Server server type is changed, monitoring performance metric data may not be collected normally for a while. Normal performance metrics will be collected in the next collection cycle (1 minute).

GPU Server

DivisionMonitoring TargetCollection MethodCollection Cycle
PerformanceOSAgent
Agentless
1m
LogOSAgentLog Occurrence Time
StatusOSAgentless1m
Fig. GPU Server Monitoring Information

Bare Metal Server

DivisionMonitoring TargetCollection MethodCollection Cycle
PerformanceOSAgent1m
LogOSAgentLog Occurrence Time
StatusOSN/A-
Table. Bare Metal Server Monitoring Information

Multi-node GPU Cluster [Cluster Fabric]

DivisionMonitoring TargetCollection MethodCollection Cycle
PerformanceOSAgent1m
LogOSAgentLog Occurrence Time
StatusOSN/A-
Fig. Multi-node GPU Cluster [Cluster Fabric] Monitoring Information

Multi-node GPU Cluster [Node]

DivisionMonitoring TargetCollection MethodCollection Cycle
PerformanceOSAgent1m
LogOSAgentLog Occurrence Time
StatusOSN/A-
Fig. Multi-node GPU Cluster [Node] Monitoring Information

Storage type

All Storage type services have the same monitoring target, collection method, and collection cycle.

  • File Storage
  • Object Storage
  • Block Storage(BM)
  • Block Storage(VM)
DivisionMonitoring TargetCollection MethodCollection Cycle
PerformanceStorageAgentless1m
LogStorageN/A-
StatusStorageAgentless1m
Table. Storage type monitoring information

Database type

The monitoring target and collection method, and collection cycle are the same for all types of Database services.

  • PostgreSQL(DBaaS)
  • MariaDB(DBaaS)
  • MySQL(DBaaS)
  • Microsoft SQL Server
  • EPAS
  • CacheStore(DBaaS)
    • Redis
    • Valkey
CategoryMonitoring TargetCollection MethodCollection Cycle
PerformanceDatabase Process, OSAgent1m
LogDatabase Process, OSAgentLog Occurrence Time
StatusDatabase ProcessAgent1m
OSAgentless1m
Table. Database type monitoring information

Data Analytics type

CategoryMonitoring TargetCollection MethodCollection Cycle
PerformanceData Analytics Process, OSAgent1m
LogData Analytics Process, OSAgentWhen a log occurs
StatusData Analytics ProcessAgent1m
OSAgentless1m
Fig. Data Analytics type monitoring information

Container type

Kubernetes Engine

DivisionMonitoring TargetCollection MethodCollection Cycle
PerformanceCluster, Namespace, Node, ReplicaSet, Deployment, StatefulSet, DaemonSet, Job, CronJob, PodAgentless5m
LogCluster, Namespace, Node, ReplicaSet, Deployment, StatefulSet, DaemonSet, Job, CronJob, PodAgentlessWhen a log occurs
StatusCluster, Namespace, Node, ReplicaSet, Deployment, StatefulSet, DaemonSet, Job, CronJob, PodAgentless5m
Fig. Kubernetes Engine monitoring information

Container Registry

CategoryMonitoring TargetCollection MethodCollection Cycle
PerformanceContainer RegistryAgentless5m
LogContainer RegistryAgentlessWhen a log occurs
StatusContainer RegistryAgentless5m
Fig. Container Registry Monitoring Information

Networking type

VPC

CategoryMonitoring TargetCollection MethodCollection Cycle
PerformanceInternet GatewayAgentless5m
LogInternet GatewayN/A-
StatusInternet GatewayN/A-
Table. Internet Gateway Monitoring Information
Caution
Performance monitoring is only possible when the Internet Gateway is created.

Load Balancer(OLD)

Load Balancer(OLD)

CategoryMonitoring TargetCollection MethodCollection Cycle
PerformanceLoad BalancerAgentless5m
LogLoad BalancerN/A-
StatusLoad BalancerAgentless5m
Fig. Load Balancer Monitoring Information

Load Balancer Listener(OLD)

CategoryMonitoring TargetCollection MethodCollection Cycle
PerformanceLoad Balancer ListenerAgentless5m
LogLoad Balancer ListenerN/A-
StatusLoad Balancer ListenerAgentless5m
Fig. Load Balancer Listener Monitoring Information

Load Balancer

Load Balancer

DivisionMonitoring TargetCollection MethodCollection Cycle
PerformanceLoad BalancerAgentless5m
LogLoad BalancerN/A-
StatusLoad BalancerAgentless5m
Fig. Load Balancer Monitoring Information

Load Balancer Listener

CategoryMonitoring TargetCollection MethodCollection Cycle
PerformanceLoad Balancer ListenerAgentless5m
LogLoad Balancer ListenerN/A-
StatusLoad Balancer ListenerAgentless5m
Fig. Load Balancer Listener Monitoring Information

Load Balancer Server Group

CategoryMonitoring TargetCollection MethodCollection Cycle
PerformanceLoad Balancer Server GroupAgentless5m
LogLoad Balancer Server GroupN/A-
StatusLoad Balancer Server GroupAgentless5m
Fig. Load Balancer Server Group Monitoring Information

Direct Connect

DivisionMonitoring TargetCollection MethodCollection Cycle
PerformanceDirect ConnectAgentless5m
LogDirect ConnectN/A-
StatusDirect ConnectN/A-
Fig. Direct Connect Monitoring Information

Cloud WAN

DivisionMonitoring TargetCollection MethodCollection Cycle
PerformanceCloud WANAgentless10m
LogCloud WANN/A-
StatusCloud WANAgentless10m
Fig. Cloud WAN Monitoring Information

Global CDN

CategoryMonitoring TargetCollection MethodCollection Cycle
PerformanceGlobal CDNAgentless5m
LogGlobal CDNN/A-
StatusGlobal CDNAgentless5m
Fig. Global CDN Monitoring Information

2.8 - Appendix B. Performance Items by Service

Compute Type

Virtual Server

Agentless (Basic Metrics)

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
MemoryMemory Total [Basic]bytes1mAvailable memory in bytes
MemoryMemory Used [Basic]bytes1mCurrently used memory in bytes
MemoryMemory Swap In [Basic]bytes1mSwapped memory in bytes
MemoryMemory Swap Out [Basic]bytes1mSwapped memory in bytes
MemoryMemory Free [Basic]bytes1mUnused memory in bytes
DiskDisk Read Bytes [Basic]bytes1mRead bytes
DiskDisk Read Requests [Basic]cnt1mNumber of read requests
DiskDisk Write Bytes [Basic]bytes1mWrite bytes
DiskDisk Write Requests [Basic]cnt1mNumber of write requests
CPUCPU Usage [Basic]%1mAverage system CPU usage over 1 minute
StateInstance State [Basic]state1mInstance state
NetworkNetwork In Bytes [Basic]bytes1mReceived bytes
NetworkNetwork In Dropped [Basic]cnt1mDropped received packets
NetworkNetwork In Packets [Basic]cnt1mNumber of received packets
NetworkNetwork Out Bytes [Basic]bytes1mSent bytes
NetworkNetwork Out Dropped [Basic]cnt1mDropped sent packets
NetworkNetwork Out Packets [Basic]cnt1mNumber of sent packets
Table. Virtual Server (Agentless) Performance Items
Note
  • For Windows OS, you need to install the Balloon Driver or the monitoring performance agent to provide memory performance indicators.

Agent (Detailed Metrics)

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
CPUCore Usage [IO Wait]%1mCPU time spent in wait state (disk wait)
CPUCore Usage [System]%1mCPU time spent in kernel space
CPUCore Usage [User]%1mCPU time spent in user space
CPUCPU Corescnt1mNumber of CPU cores on the host. The maximum value of the unnormalized ratio is 100%* of the number of cores. The unnormalized ratio already reflects this value, and the maximum value is 100%* of the number of cores.
CPUCPU Usage [Active]%1mCPU time used, excluding idle and IOWait states (using all 4 cores at 100%: 400%)
CPUCPU Usage [Idle]%1mCPU time spent in idle state
CPUCPU Usage [IO Wait]%1mCPU time spent in wait state (disk wait)
CPUCPU Usage [System]%1mCPU time used by the kernel (using all 4 cores at 100%: 400%)
CPUCPU Usage [User]%1mCPU time used by the user (using all 4 cores at 100%: 400%)
CPUCPU Usage/Core [Active]%1mCPU time used, excluding idle and IOWait states (normalized by the number of cores, using all 4 cores at 100%: 100%)
CPUCPU Usage/Core [Idle]%1mCPU time spent in idle state
CPUCPU Usage/Core [IO Wait]%1mCPU time spent in wait state (disk wait)
CPUCPU Usage/Core [System]%1mCPU time used by the kernel (normalized by the number of cores, using all 4 cores at 100%: 100%)
CPUCPU Usage/Core [User]%1mCPU time used by the user (normalized by the number of cores, using all 4 cores at 100%: 100%)
DiskDisk CPU Usage [IO Request]%1mCPU time spent executing I/O requests to the device (device bandwidth utilization). If this value is close to 100%, the device is saturated.
DiskDisk Queue Size [Avg]num1mAverage queue length of requests executed on the device
DiskDisk Read Bytesbytes1mBytes read from the device per second
DiskDisk Read Bytes [Delta Avg]bytes1mAverage of system.diskio.read.bytes_delta for individual disks
DiskDisk Read Bytes [Delta Max]bytes1mMaximum of system.diskio.read.bytes_delta for individual disks
DiskDisk Read Bytes [Delta Min]bytes1mMinimum of system.diskio.read.bytes_delta for individual disks
DiskDisk Read Bytes [Delta Sum]bytes1mSum of system.diskio.read.bytes_delta for individual disks
DiskDisk Read Bytes [Delta]bytes1mDelta of system.diskio.read.bytes for individual disks
DiskDisk Read Bytes [Success]bytes1mTotal bytes read successfully. For Linux, it is assumed that the sector size is 512 and the value is the number of sectors read multiplied by 512
DiskDisk Read Requestscnt1mNumber of read requests to the disk device per second
DiskDisk Read Requests [Delta Avg]cnt1mAverage of system.diskio.read.count_delta for individual disks
DiskDisk Read Requests [Delta Max]cnt1mMaximum of system.diskio.read.count_delta for individual disks
DiskDisk Read Requests [Delta Min]cnt1mMinimum of system.diskio.read.count_delta for individual disks
DiskDisk Read Requests [Delta Sum]cnt1mSum of system.diskio.read.count_delta for individual disks
DiskDisk Read Requests [Success Delta]cnt1mDelta of system.diskio.read.count for individual disks
DiskDisk Read Requests [Success]cnt1mTotal number of successful read completions
DiskDisk Request Size [Avg]num1mAverage size of requests executed on the device (in sectors)
DiskDisk Service Time [Avg]ms1mAverage service time of I/O requests executed on the device (in milliseconds)
DiskDisk Wait Time [Avg]ms1mAverage time spent waiting for I/O requests to be executed on the device
DiskDisk Wait Time [Read]ms1mAverage disk read wait time
DiskDisk Wait Time [Write]ms1mAverage disk write wait time
DiskDisk Write Bytes [Delta Avg]bytes1mAverage of system.diskio.write.bytes_delta for individual disks
DiskDisk Write Bytes [Delta Max]bytes1mMaximum of system.diskio.write.bytes_delta for individual disks
DiskDisk Write Bytes [Delta Min]bytes1mMinimum of system.diskio.write.bytes_delta for individual disks
DiskDisk Write Bytes [Delta Sum]bytes1mSum of system.diskio.write.bytes_delta for individual disks
DiskDisk Write Bytes [Delta]bytes1mDelta of system.diskio.write.bytes for individual disks
DiskDisk Write Bytes [Success]bytes1mTotal bytes written successfully. For Linux, it is assumed that the sector size is 512 and the value is the number of sectors written multiplied by 512
DiskDisk Write Requestscnt1mNumber of write requests to the disk device per second
DiskDisk Write Requests [Delta Avg]cnt1mAverage of system.diskio.write.count_delta for individual disks
DiskDisk Write Requests [Delta Max]cnt1mMaximum of system.diskio.write.count_delta for individual disks
DiskDisk Write Requests [Delta Min]cnt1mMinimum of system.diskio.write.count_delta for individual disks
DiskDisk Write Requests [Delta Sum]cnt1mSum of system.diskio.write.count_delta for individual disks
DiskDisk Write Requests [Success Delta]cnt1mDelta of system.diskio.write.count for individual disks
DiskDisk Write Requests [Success]cnt1mTotal number of successful writes
DiskDisk Writes Bytesbytes1mBytes written to the device per second
FileSystemFilesystem Hang Checkstate1mFilesystem (local/NFS) hang check (normal: 1, abnormal: 0)
FileSystemFilesystem Nodescnt1mTotal number of file nodes in the filesystem
FileSystemFilesystem Nodes [Free]cnt1mTotal number of available file nodes in the filesystem
FileSystemFilesystem Size [Available]bytes1mDisk space available for non-privileged users (in bytes)
FileSystemFilesystem Size [Free]bytes1mAvailable disk space (in bytes)
FileSystemFilesystem Size [Total]bytes1mTotal disk space (in bytes)
FileSystemFilesystem Usage%1mPercentage of used disk space
FileSystemFilesystem Usage [Avg]%1mAverage of filesystem.used.pct for individual filesystems
FileSystemFilesystem Usage [Inode]%1mInode usage rate
FileSystemFilesystem Usage [Max]%1mMaximum of filesystem.used.pct for individual filesystems
FileSystemFilesystem Usage [Min]%1mMinimum of filesystem.used.pct for individual filesystems
FileSystemFilesystem Usage [Total]%1m-
FileSystemFilesystem Usedbytes1mUsed disk space (in bytes)
FileSystemFilesystem Used [Inode]bytes1mInode usage
MemoryMemory Freebytes1mTotal available memory (in bytes), excluding system cache and buffer memory (see system.memory.actual.free).
MemoryMemory Free [Actual]bytes1mActual available memory (in bytes). The calculation method varies depending on the OS. For Linux, it is the value of MemAvailable in /proc/meminfo, or the calculated value using available memory and cache/buffer memory if /proc/meminfo is not available. For OSX, it is the sum of available memory and inactive memory. For Windows, it is the same value as system.memory.free.
MemoryMemory Free [Swap]bytes1mAvailable swap memory
MemoryMemory Totalbytes1mTotal memory
MemoryMemory Total [Swap]bytes1mTotal swap memory
MemoryMemory Usage%1mPercentage of used memory
MemoryMemory Usage [Actual]%1mPercentage of actually used memory
MemoryMemory Usage [Cache Swap]%1mCache swap usage rate
MemoryMemory Usage [Swap]%1mPercentage of used swap memory
MemoryMemory Usedbytes1mUsed memory
MemoryMemory Used [Actual]bytes1mActually used memory (in bytes). The value of total memory minus used memory. The available memory is calculated differently depending on the OS (see system.actual.free).
MemoryMemory Used [Swap]bytes1mUsed swap memory
NetworkCollisionscnt1mNetwork collisions
NetworkNetwork In Bytesbytes1mReceived bytes
NetworkNetwork In Bytes [Delta Avg]bytes1mAverage of system.network.in.bytes_delta for individual networks
NetworkNetwork In Bytes [Delta Max]bytes1mMaximum of system.network.in.bytes_delta for individual networks
NetworkNetwork In Bytes [Delta Min]bytes1mMinimum of system.network.in.bytes_delta for individual networks
NetworkNetwork In Bytes [Delta Sum]bytes1mSum of system.network.in.bytes_delta for individual networks
NetworkNetwork In Bytes [Delta]bytes1mDelta of received bytes
NetworkNetwork In Droppedcnt1mDropped received packets
NetworkNetwork In Errorscnt1mNumber of receive errors
NetworkNetwork In Packetscnt1mNumber of received packets
NetworkNetwork In Packets [Delta Avg]cnt1mAverage of system.network.in.packets_delta for individual networks
NetworkNetwork In Packets [Delta Max]cnt1mMaximum of system.network.in.packets_delta for individual networks
NetworkNetwork In Packets [Delta Min]cnt1mMinimum of system.network.in.packets_delta for individual networks
NetworkNetwork In Packets [Delta Sum]cnt1mSum of system.network.in.packets_delta for individual networks
NetworkNetwork In Packets [Delta]cnt1mDelta of received packets
NetworkNetwork Out Bytesbytes1mSent bytes
NetworkNetwork Out Bytes [Delta Avg]bytes1mAverage of system.network.out.bytes_delta for individual networks
NetworkNetwork Out Bytes [Delta Max]bytes1mMaximum of system.network.out.bytes_delta for individual networks
NetworkNetwork Out Bytes [Delta Min]bytes1mMinimum of system.network.out.bytes_delta for individual networks
NetworkNetwork Out Bytes [Delta Sum]bytes1mSum of system.network.out.bytes_delta for individual networks
NetworkNetwork Out Bytes [Delta]bytes1mDelta of sent bytes
NetworkNetwork Out Droppedcnt1mDropped sent packets. This value is not reported by the OS, so it is always 0 on Darwin and BSD.
NetworkNetwork Out Errorscnt1mNumber of transmit errors
NetworkNetwork Out Packetscnt1mNumber of sent packets
NetworkNetwork Out Packets [Delta Avg]cnt1mAverage of system.network.out.packets_delta for individual networks
NetworkNetwork Out Packets [Delta Max]cnt1mMaximum of system.network.out.packets_delta for individual networks
NetworkNetwork Out Packets [Delta Min]cnt1mMinimum of system.network.out.packets_delta for individual networks
NetworkNetwork Out Packets [Delta Sum]cnt1mSum of system.network.out.packets_delta for individual networks
NetworkNetwork Out Packets [Delta]cnt1mDelta of sent packets
NetworkOpen Connections [TCP]cnt1mNumber of open TCP connections
NetworkOpen Connections [UDP]cnt1mNumber of open UDP connections
NetworkPort Usage%1mPort usage rate
NetworkSYN Sent Socketscnt1mNumber of sockets in the SYN_SENT state (when connecting to a remote host)
ProcessKernel PID Maxcnt1mValue of kernel.pid_max
ProcessKernel Thread Maxcnt1mValue of kernel.threads-max
ProcessProcess CPU Usage%1mPercentage of CPU time consumed by the process since the last update. This value is similar to the %CPU value displayed by the top command on Unix systems.
ProcessProcess CPU Usage/Core%1mPercentage of CPU time used by the process since the last event. This value is normalized by the number of cores and ranges from 0 to 100%.
ProcessProcess Memory Usage%1mPercentage of main memory (RAM) used by the process
ProcessProcess Memory Usedbytes1mResident Set size. The amount of memory used by the process in RAM. On Windows, this is the current working set size.
ProcessProcess PIDPID1mProcess PID
ProcessProcess PPIDPID1mParent process PID
ProcessProcesses [Dead]cnt1mNumber of dead processes
ProcessProcesses [Idle]cnt1mNumber of idle processes
ProcessProcesses [Running]cnt1mNumber of running processes
ProcessProcesses [Sleeping]cnt1mNumber of sleeping processes
ProcessProcesses [Stopped]cnt1mNumber of stopped processes
ProcessProcesses [Total]cnt1mTotal number of processes
ProcessProcesses [Unknown]cnt1mNumber of processes with unknown or unsearchable status
ProcessProcesses [Zombie]cnt1mNumber of zombie processes
ProcessRunning Process Usage%1mProcess usage rate
ProcessRunning Processescnt1mNumber of running processes
ProcessRunning Thread Usage%1mThread usage rate
ProcessRunning Threadscnt1mTotal number of threads running in running processes
SystemContext Switchescnt1mNumber of context switches (per second)
SystemLoad/Core [1 min]cnt1mLoad over the last 1 minute, normalized by the number of cores
SystemLoad/Core [15 min]cnt1mLoad over the last 15 minutes, normalized by the number of cores
SystemLoad/Core [5 min]cnt1mLoad over the last 5 minutes, normalized by the number of cores
SystemMultipaths [Active]cnt1mNumber of active paths for external storage connections
SystemMultipaths [Failed]cnt1mNumber of failed paths for external storage connections
SystemMultipaths [Faulty]cnt1mNumber of faulty paths for external storage connections
SystemNTP Offsetnum1mMeasured offset (time difference between the NTP server and the local environment) of the last sample
SystemRun Queue Lengthnum1mLength of the run queue
SystemUptimems1mSystem uptime (in milliseconds)
WindowsContext Switchiescnt1mNumber of CPU context switches (per second)
WindowsDisk Read Bytes [Sec]cnt1mNumber of bytes read from the Windows logical disk per second
WindowsDisk Read Time [Avg]sec1mAverage time spent reading data (in seconds)
WindowsDisk Transfer Time [Avg]sec1mAverage disk wait time
WindowsDisk Usage%1mDisk usage rate
WindowsDisk Write Bytes [Sec]cnt1mNumber of bytes written to the Windows logical disk per second
WindowsDisk Write Time [Avg]sec1mAverage time spent writing data (in seconds)
WindowsPagingfile Usage%1mPaging file usage rate
WindowsPool Used [Non Paged]bytes1mNon-paged pool usage of kernel memory
WindowsPool Used [Paged]bytes1mPaged pool usage of kernel memory
WindowsProcess [Running]cnt1mNumber of currently running processes
WindowsThreads [Running]cnt1mNumber of currently running threads
WindowsThreads [Waiting]cnt1mNumber of threads waiting for processor time
Table. Virtual Server (Agent) Performance Items

GPU Server

Agentless (Basic Metrics)

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
MemoryMemory Total [Basic]bytes1mAvailable memory in bytes
MemoryMemory Used [Basic]bytes1mCurrently used memory in bytes
MemoryMemory Swap In [Basic]bytes1mSwapped memory in bytes
MemoryMemory Swap Out [Basic]bytes1mSwapped memory in bytes
MemoryMemory Free [Basic]bytes1mUnused memory in bytes
DiskDisk Read Bytes [Basic]bytes1mRead bytes
DiskDisk Read Requests [Basic]cnt1mNumber of read requests
DiskDisk Write Bytes [Basic]bytes1mWrite bytes
DiskDisk Write Requests [Basic]cnt1mNumber of write requests
CPUCPU Usage [Basic]%1mAverage system CPU usage over 1 minute
StateInstance State [Basic]state1mInstance state
NetworkNetwork In Bytes [Basic]bytes1mReceived bytes
NetworkNetwork In Dropped [Basic]cnt1mDropped received packets
NetworkNetwork In Packets [Basic]cnt1mNumber of received packets
NetworkNetwork Out Bytes [Basic]bytes1mSent bytes
NetworkNetwork Out Dropped [Basic]cnt1mDropped sent packets
NetworkNetwork Out Packets [Basic]cnt1mNumber of sent packets
Table. GPU Server (Agentless) Performance Items

Agent (Detailed Metrics)

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
GPUGPU Countcnt1mNumber of GPUs
GPUGPU Memory Usage%1mGPU memory usage rate
GPUGPU Memory Usedbytes1mGPU memory usage
GPUGPU Temperature1mGPU temperature
GPUGPU Usage%1mTotal GPU usage rate (using all 8 GPUs at 100%: 800%)
GPUGPU Usage [Avg]%1mAverage GPU usage rate (%)
GPUGPU Power CapW1mMaximum power capacity of the GPU
GPUGPU Power UsageW1mCurrent power usage of the GPU
GPUGPU Memory Usage [Avg]%1mAverage GPU memory usage rate
GPUGPU Count in usecnt1mNumber of GPUs in use by jobs running on the node
GPUExecution State for nvidia-smistate1mExecution result of the nvidia-smi command
CPUCore Usage [IO Wait]%1mCPU time spent in wait state (disk wait)
CPUCore Usage [System]%1mCPU time spent in kernel space
CPUCore Usage [User]%1mCPU time spent in user space
CPUCPU Corescnt1mNumber of CPU cores on the host. The maximum value of the unnormalized ratio is 100%* of the number of cores. The unnormalized ratio already reflects this value, and the maximum value is 100%* of the number of cores.
CPUCPU Usage [Active]%1mCPU time used, excluding idle and IOWait states (using all 4 cores at 100%: 400%)
CPUCPU Usage [Idle]%1mCPU time spent in idle state
CPUCPU Usage [IO Wait]%1mCPU time spent in wait state (disk wait)
CPUCPU Usage [System]%1mCPU time used by the kernel (using all 4 cores at 100%: 400%)
CPUCPU Usage [User]%1mCPU time used by the user (using all 4 cores at 100%: 400%)
CPUCPU Usage/Core [Active]%1mCPU time used, excluding idle and IOWait states (normalized by the number of cores, using all 4 cores at 100%: 100%)
CPUCPU Usage/Core [Idle]%1mCPU time spent in idle state
CPUCPU Usage/Core [IO Wait]%1mCPU time spent in wait state (disk wait)
CPUCPU Usage/Core [System]%1mCPU time used by the kernel (normalized by the number of cores, using all 4 cores at 100%: 100%)
CPUCPU Usage/Core [User]%1mCPU time used by the user (normalized by the number of cores, using all 4 cores at 100%: 100%)
DiskDisk CPU Usage [IO Request]%1mThe ratio of CPU time spent executing I/O requests for the device (device bandwidth utilization). If this value is close to 100%, the device is in a saturated state.
DiskDisk Queue Size [Avg]num1mThe average queue length of requests executed for the device.
DiskDisk Read Bytesbytes1mThe number of bytes read from the device per second.
DiskDisk Read Bytes [Delta Avg]bytes1mThe average of system.diskio.read.bytes_delta for individual disks.
DiskDisk Read Bytes [Delta Max]bytes1mThe maximum of system.diskio.read.bytes_delta for individual disks.
DiskDisk Read Bytes [Delta Min]bytes1mThe minimum of system.diskio.read.bytes_delta for individual disks.
DiskDisk Read Bytes [Delta Sum]bytes1mThe sum of system.diskio.read.bytes_delta for individual disks.
DiskDisk Read Bytes [Delta]bytes1mThe delta value of system.diskio.read.bytes for individual disks.
DiskDisk Read Bytes [Success]bytes1mThe total number of bytes read successfully. On Linux, it is assumed that the sector size is 512 and the value is calculated by multiplying the number of sectors read by 512.
DiskDisk Read Requestscnt1mThe number of read requests for the disk device per second.
DiskDisk Read Requests [Delta Avg]cnt1mThe average of system.diskio.read.count_delta for individual disks.
DiskDisk Read Requests [Delta Max]cnt1mThe maximum of system.diskio.read.count_delta for individual disks.
DiskDisk Read Requests [Delta Min]cnt1mThe minimum of system.diskio.read.count_delta for individual disks.
DiskDisk Read Requests [Delta Sum]cnt1mThe sum of system.diskio.read.count_delta for individual disks.
DiskDisk Read Requests [Success Delta]cnt1mThe delta value of system.diskio.read.count for individual disks.
DiskDisk Read Requests [Success]cnt1mThe total number of successful read requests.
DiskDisk Request Size [Avg]num1mThe average size of requests executed for the device (in sectors).
DiskDisk Service Time [Avg]ms1mThe average service time for input requests executed for the device (in milliseconds).
DiskDisk Wait Time [Avg]ms1mThe average time spent executing requests for the device.
DiskDisk Wait Time [Read]ms1mThe average disk wait time for read operations.
DiskDisk Wait Time [Write]ms1mThe average disk wait time for write operations.
DiskDisk Write Bytes [Delta Avg]bytes1mThe average of system.diskio.write.bytes_delta for individual disks.
DiskDisk Write Bytes [Delta Max]bytes1mThe maximum of system.diskio.write.bytes_delta for individual disks.
DiskDisk Write Bytes [Delta Min]bytes1mThe minimum of system.diskio.write.bytes_delta for individual disks.
DiskDisk Write Bytes [Delta Sum]bytes1mThe sum of system.diskio.write.bytes_delta for individual disks.
DiskDisk Write Bytes [Delta]bytes1mThe delta value of system.diskio.write.bytes for individual disks.
DiskDisk Write Bytes [Success]bytes1mThe total number of bytes written successfully. On Linux, it is assumed that the sector size is 512 and the value is calculated by multiplying the number of sectors written by 512.
DiskDisk Write Requestscnt1mThe number of write requests for the disk device per second.
DiskDisk Write Requests [Delta Avg]cnt1mThe average of system.diskio.write.count_delta for individual disks.
DiskDisk Write Requests [Delta Max]cnt1mThe maximum of system.diskio.write.count_delta for individual disks.
DiskDisk Write Requests [Delta Min]cnt1mThe minimum of system.diskio.write.count_delta for individual disks.
DiskDisk Write Requests [Delta Sum]cnt1mThe sum of system.diskio.write.count_delta for individual disks.
DiskDisk Write Requests [Success Delta]cnt1mThe delta value of system.diskio.write.count for individual disks.
DiskDisk Write Requests [Success]cnt1mThe total number of successful write requests.
DiskDisk Writes Bytesbytes1mThe number of bytes written to the device per second.
FileSystemFilesystem Hang Checkstate1mFilesystem (local/NFS) hang check (normal: 1, abnormal: 0).
FileSystemFilesystem Nodescnt1mThe total number of file nodes in the file system.
FileSystemFilesystem Nodes [Free]cnt1mThe total number of available file nodes in the file system.
FileSystemFilesystem Size [Available]bytes1mThe available disk space (in bytes) that can be used by non-privileged users.
FileSystemFilesystem Size [Free]bytes1mThe available disk space (in bytes).
FileSystemFilesystem Size [Total]bytes1mThe total disk space (in bytes).
FileSystemFilesystem Usage%1mThe percentage of used disk space.
FileSystemFilesystem Usage [Avg]%1mThe average of filesystem.used.pct for individual file systems.
FileSystemFilesystem Usage [Inode]%1mThe inode usage rate.
FileSystemFilesystem Usage [Max]%1mThe maximum of filesystem.used.pct for individual file systems.
FileSystemFilesystem Usage [Min]%1mThe minimum of filesystem.used.pct for individual file systems.
FileSystemFilesystem Usage [Total]%1m-
FileSystemFilesystem Usedbytes1mThe used disk space (in bytes).
FileSystemFilesystem Used [Inode]bytes1mThe inode usage.
MemoryMemory Freebytes1mThe total available memory (in bytes), excluding memory used by system cache and buffers (see system.memory.actual.free).
MemoryMemory Free [Actual]bytes1mThe actual available memory (in bytes), which varies depending on the OS. On Linux, it is calculated using /proc/meminfo, and on OSX, it is the sum of available and inactive memory. On Windows, it is the same as system.memory.free.
MemoryMemory Free [Swap]bytes1mThe available swap memory.
MemoryMemory Totalbytes1mThe total memory.
MemoryMemory Total [Swap]bytes1mThe total swap memory.
MemoryMemory Usage%1mThe percentage of used memory.
MemoryMemory Usage [Actual]%1mThe percentage of actual used memory.
MemoryMemory Usage [Cache Swap]%1mThe cache swap usage rate.
MemoryMemory Usage [Swap]%1mThe percentage of used swap memory.
MemoryMemory Usedbytes1mThe used memory.
MemoryMemory Used [Actual]bytes1mThe actual used memory (in bytes), which is the total memory minus the used memory. The available memory varies depending on the OS (see system.actual.free).
MemoryMemory Used [Swap]bytes1mThe used swap memory.
NetworkCollisionscnt1mNetwork collisions.
NetworkNetwork In Bytesbytes1mThe number of bytes received.
NetworkNetwork In Bytes [Delta Avg]bytes1mThe average of system.network.in.bytes_delta for individual networks.
NetworkNetwork In Bytes [Delta Max]bytes1mThe maximum of system.network.in.bytes_delta for individual networks.
NetworkNetwork In Bytes [Delta Min]bytes1mThe minimum of system.network.in.bytes_delta for individual networks.
NetworkNetwork In Bytes [Delta Sum]bytes1mThe sum of system.network.in.bytes_delta for individual networks.
NetworkNetwork In Bytes [Delta]bytes1mThe delta value of the number of bytes received.
NetworkNetwork In Droppedcnt1mThe number of packets dropped during reception.
NetworkNetwork In Errorscnt1mThe number of errors during reception.
NetworkNetwork In Packetscnt1mThe number of packets received.
NetworkNetwork In Packets [Delta Avg]cnt1mThe average of system.network.in.packets_delta for individual networks.
NetworkNetwork In Packets [Delta Max]cnt1mThe maximum of system.network.in.packets_delta for individual networks.
NetworkNetwork In Packets [Delta Min]cnt1mThe minimum of system.network.in.packets_delta for individual networks.
NetworkNetwork In Packets [Delta Sum]cnt1mThe sum of system.network.in.packets_delta for individual networks.
NetworkNetwork In Packets [Delta]cnt1mThe delta value of the number of packets received.
NetworkNetwork Out Bytesbytes1mThe number of bytes sent.
NetworkNetwork Out Bytes [Delta Avg]bytes1mThe average of system.network.out.bytes_delta for individual networks.
NetworkNetwork Out Bytes [Delta Max]bytes1mThe maximum of system.network.out.bytes_delta for individual networks.
NetworkNetwork Out Bytes [Delta Min]bytes1mThe minimum of system.network.out.bytes_delta for individual networks.
NetworkNetwork Out Bytes [Delta Sum]bytes1mThe sum of system.network.out.bytes_delta for individual networks.
NetworkNetwork Out Bytes [Delta]bytes1mThe delta value of the number of bytes sent.
NetworkNetwork Out Droppedcnt1mThe number of packets dropped during transmission. This value is not reported by the OS and is always 0 on Darwin and BSD.
NetworkNetwork Out Errorscnt1mThe number of errors during transmission.
NetworkNetwork Out Packetscnt1mThe number of packets sent.
NetworkNetwork Out Packets [Delta Avg]cnt1mThe average of system.network.out.packets_delta for individual networks.
NetworkNetwork Out Packets [Delta Max]cnt1mThe maximum of system.network.out.packets_delta for individual networks.
NetworkNetwork Out Packets [Delta Min]cnt1mThe minimum of system.network.out.packets_delta for individual networks.
NetworkNetwork Out Packets [Delta Sum]cnt1mThe sum of system.network.out.packets_delta for individual networks.
NetworkNetwork Out Packets [Delta]cnt1mThe delta value of the number of packets sent.
NetworkOpen Connections [TCP]cnt1mThe number of open TCP connections.
NetworkOpen Connections [UDP]cnt1mThe number of open UDP connections.
NetworkPort Usage%1mThe port usage rate.
NetworkSYN Sent Socketscnt1mThe number of sockets in the SYN_SENT state (when connecting to a remote host).
ProcessKernel PID Maxcnt1mThe kernel.pid_max value.
ProcessKernel Thread Maxcnt1mThe kernel.threads-max value.
ProcessProcess CPU Usage%1mThe percentage of CPU time consumed by the process since the last update. This value is similar to the %CPU value displayed by the top command on Unix systems.
ProcessProcess CPU Usage/Core%1mThe percentage of CPU time used by the process since the last event, normalized by the number of cores (0-100%).
ProcessProcess Memory Usage%1mThe percentage of main memory (RAM) used by the process.
ProcessProcess Memory Usedbytes1mThe resident set size, which is the amount of memory used by the process in RAM. On Windows, it is the current working set size.
ProcessProcess PIDPID1mThe process ID.
ProcessProcess PPIDPID1mThe parent process ID.
ProcessProcesses [Dead]cnt1mThe number of dead processes.
ProcessProcesses [Idle]cnt1mThe number of idle processes.
ProcessProcesses [Running]cnt1mThe number of running processes.
ProcessProcesses [Sleeping]cnt1mThe number of sleeping processes.
ProcessProcesses [Stopped]cnt1mThe number of stopped processes.
ProcessProcesses [Total]cnt1mThe total number of processes.
ProcessProcesses [Unknown]cnt1mThe number of processes with unknown or unsearchable states.
ProcessProcesses [Zombie]cnt1mThe number of zombie processes.
ProcessRunning Process Usage%1mThe process usage rate.
ProcessRunning Processescnt1mThe number of running processes.
ProcessRunning Thread Usage%1mThe thread usage rate.
ProcessRunning Threadscnt1mThe total number of threads running in running processes.
SystemContext Switchescnt1mThe number of context switches per second.
SystemLoad/Core [1 min]cnt1mThe load average over the last 1 minute, normalized by the number of cores.
SystemLoad/Core [15 min]cnt1mThe load average over the last 15 minutes, normalized by the number of cores.
SystemLoad/Core [5 min]cnt1mThe load average over the last 5 minutes, normalized by the number of cores.
SystemMultipaths [Active]cnt1mThe number of active paths for external storage connections.
SystemMultipaths [Failed]cnt1mThe number of failed paths for external storage connections.
SystemMultipaths [Faulty]cnt1mThe number of faulty paths for external storage connections.
SystemNTP Offsetnum1mThe measured offset (time difference between the NTP server and the local environment) of the last sample.
SystemRun Queue Lengthnum1mThe length of the run queue.
SystemUptimems1mThe OS uptime (in milliseconds).
WindowsContext Switchiescnt1mThe number of CPU context switches per second.
WindowsDisk Read Bytes [Sec]cnt1mThe number of bytes read from the Windows logical disk per second.
WindowsDisk Read Time [Avg]sec1mThe average time spent reading data (in seconds).
WindowsDisk Transfer Time [Avg]sec1mThe average disk wait time.
WindowsDisk Usage%1mThe disk usage rate.
WindowsDisk Write Bytes [Sec]cnt1mThe number of bytes written to the Windows logical disk per second.
WindowsDisk Write Time [Avg]sec1mThe average time spent writing data (in seconds).
WindowsPagingfile Usage%1mThe paging file usage rate.
WindowsPool Used [Non Paged]bytes1mThe Nonpaged Pool usage of kernel memory.
WindowsPool Used [Paged]bytes1mThe Paged Pool usage of kernel memory.
WindowsProcess [Running]cnt1mThe number of currently running processes.
WindowsThreads [Running]cnt1mThe number of currently running threads.
WindowsThreads [Waiting]cnt1mThe number of threads waiting for processor time.
Table. Performance Items for GPU Server (Agent)

Bare Metal Server

Agent (Detailed Metrics)

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
CPUCore Usage [IO Wait]%1mThe ratio of CPU time spent in a waiting state (disk wait).
CPUCore Usage [System]%1mThe percentage of CPU time used by the kernel.
CPUCore Usage [User]%1mThe percentage of CPU time used in the user space.
CPUCPU Corescnt1mThe number of CPU cores on the host. The maximum value for unnormalized rates is 100% * the number of cores. The maximum value for normalized rates is 100%.
CPUCPU Usage [Active]%1mThe percentage of CPU time used, excluding idle and IOWait states (all 4 cores using 100%: 400%).
CPUCPU Usage [Idle]%1mThe ratio of CPU time spent in an idle state.
CPUCPU Usage [IO Wait]%1mThe ratio of CPU time spent in a waiting state (disk wait).
CPUCPU Usage [System]%1mThe percentage of CPU time used by the kernel (all 4 cores using 100%: 400%).
CPUCPU Usage [User]%1mThe percentage of CPU time used in the user area (all 4 cores using 100%: 400%).
CPUCPU Usage/Core [Active]%1mThe percentage of CPU time used, excluding idle and IOWait states (normalized by the number of cores, all 4 cores using 100%: 100%).
CPUCPU Usage/Core [Idle]%1mThe ratio of CPU time spent in an idle state.
CPUCPU Usage/Core [IO Wait]%1mThe ratio of CPU time spent in a waiting state (disk wait).
CPUCPU Usage/Core [System]%1mThe percentage of CPU time used by the kernel (normalized by the number of cores, all 4 cores using 100%: 100%).
CPUCPU Usage/Core [User]%1mThe percentage of CPU time used in the user area (normalized by the number of cores, all 4 cores using 100%: 100%).
DiskDisk CPU Usage [IO Request]%1mThe ratio of CPU time spent executing I/O requests for the device (device bandwidth utilization). If this value is close to 100%, the device is in a saturated state.
DiskDisk Queue Size [Avg]num1mThe average queue length of requests executed for the device.
DiskDisk Read Bytesbytes1mThe number of bytes read from the device per second.
DiskDisk Read Bytes [Delta Avg]bytes1mThe average of system.diskio.read.bytes_delta for individual disks.
DiskDisk Read Bytes [Delta Max]bytes1mThe maximum of system.diskio.read.bytes_delta for individual disks.
DiskDisk Read Bytes [Delta Min]bytes1mThe minimum of system.diskio.read.bytes_delta for individual disks.
DiskDisk Read Bytes [Delta Sum]bytes1mThe sum of system.diskio.read.bytes_delta for individual disks.
DiskDisk Read Bytes [Delta]bytes1mThe delta value of system.diskio.read.bytes for individual disks.
DiskDisk Read Bytes [Success]bytes1mThe total number of bytes read successfully. On Linux, it is assumed that the sector size is 512 and the value is calculated by multiplying the number of sectors read by 512.
DiskDisk Read Requestscnt1mThe number of read requests for the disk device per second.
DiskDisk Read Requests [Delta Avg]cnt1mThe average of system.diskio.read.count_delta for individual disks.
DiskDisk Read Requests [Delta Max]cnt1mThe maximum of system.diskio.read.count_delta for individual disks.
DiskDisk Read Requests [Delta Min]cnt1mThe minimum of system.diskio.read.count_delta for individual disks.
DiskDisk Read Requests [Delta Sum]cnt1mThe sum of system.diskio.read.count_delta for individual disks.
DiskDisk Read Requests [Success Delta]cnt1mThe delta value of system.diskio.read.count for individual disks.
DiskDisk Read Requests [Success]cnt1mThe total number of successful read requests.
DiskDisk Request Size [Avg]num1mThe average size of requests executed for the device (in sectors).
DiskDisk Service Time [Avg]ms1mThe average service time for input requests executed for the device (in milliseconds).
DiskDisk Wait Time [Avg]ms1mThe average time spent executing requests for the device.
DiskDisk Wait Time [Read]ms1mThe average disk wait time for read operations.
DiskDisk Wait Time [Write]ms1mThe average disk wait time for write operations.
DiskDisk Write Bytes [Delta Avg]bytes1mThe average of system.diskio.write.bytes_delta for individual disks.
DiskDisk Write Bytes [Delta Max]bytes1mThe maximum of system.diskio.write.bytes_delta for individual disks.
DiskDisk Write Bytes [Delta Min]bytes1mThe minimum of system.diskio.write.bytes_delta for individual disks.
DiskDisk Write Bytes [Delta Sum]bytes1mThe sum of system.diskio.write.bytes_delta for individual disks.
DiskDisk Write Bytes [Delta]bytes1mThe delta value of system.diskio.write.bytes for individual disks.
DiskDisk Write Bytes [Success]bytes1mThe total number of bytes written successfully. On Linux, it is assumed that the sector size is 512 and the value is calculated by multiplying the number of sectors written by 512.
DiskDisk Write Requestscnt1mThe number of write requests for the disk device per second.
DiskDisk Write Requests [Delta Avg]cnt1mThe average of system.diskio.write.count_delta for individual disks.
DiskDisk Write Requests [Delta Max]cnt1mThe maximum of system.diskio.write.count_delta for individual disks.
DiskDisk Write Requests [Delta Min]cnt1mThe minimum of system.diskio.write.count_delta for individual disks.
DiskDisk Write Requests [Delta Sum]cnt1mThe sum of system.diskio.write.count_delta for individual disks.
DiskDisk Write Requests [Success Delta]cnt1mThe delta value of system.diskio.write.count for individual disks.
DiskDisk Write Requests [Success]cnt1mThe total number of successful write requests.
DiskDisk Writes Bytesbytes1mThe number of bytes written to the device per second.
FileSystemFilesystem Hang Checkstate1mFilesystem (local/NFS) hang check (normal: 1, abnormal: 0).
FileSystemFilesystem Nodescnt1mThe total number of file nodes in the file system.
FileSystemFilesystem Nodes [Free]cnt1mThe total number of available file nodes in the file system.
FileSystemFilesystem Size [Available]bytes1mThe available disk space (in bytes) that can be used by non-privileged users.
FileSystemFilesystem Size [Free]bytes1mThe available disk space (in bytes).
FileSystemFilesystem Size [Total]bytes1mThe total disk space (in bytes).
FileSystemFilesystem Usage%1mThe percentage of used disk space.
FileSystemFilesystem Usage [Avg]%1mThe average of filesystem.used.pct for individual file systems.
FileSystemFilesystem Usage [Inode]%1mThe inode usage rate.
FileSystemFilesystem Usage [Max]%1mThe maximum of filesystem.used.pct for individual file systems.
FileSystemFilesystem Usage [Min]%1mThe minimum of filesystem.used.pct for individual file systems.
FileSystemFilesystem Usage [Total]%1m-
FileSystemFilesystem Usedbytes1mThe used disk space (in bytes).
FileSystemFilesystem Used [Inode]bytes1mThe inode usage.
MemoryMemory Freebytes1mThe total available memory (in bytes), excluding memory used by system cache and buffers (see system.memory.actual.free).
MemoryMemory Free [Actual]bytes1mThe actual available memory (in bytes), which varies depending on the OS. On Linux, it is calculated using /proc/meminfo, and on OSX, it is the sum of available and inactive memory. On Windows, it is the same as system.memory.free.
MemoryMemory Free [Swap]bytes1mThe available swap memory.
MemoryMemory Totalbytes1mThe total memory.
MemoryMemory Total [Swap]bytes1mThe total swap memory.
MemoryMemory Usage%1mThe percentage of used memory.
MemoryMemory Usage [Actual]%1mThe percentage of actual used memory.
MemoryMemory Usage [Cache Swap]%1mThe cache swap usage rate.
MemoryMemory Usage [Swap]%1mThe percentage of used swap memory.
MemoryMemory Usedbytes1mThe used memory.
MemoryMemory Used [Actual]bytes1mThe actual used memory (in bytes), which is the total memory minus the used memory. The available memory varies depending on the OS (see system.actual.free).
MemoryMemory Used [Swap]bytes1mThe used swap memory.
NetworkCollisionscnt1mNetwork collisions.
NetworkNetwork In Bytesbytes1mThe number of bytes received.
NetworkNetwork In Bytes [Delta Avg]bytes1mThe average of system.network.in.bytes_delta for individual networks.
NetworkNetwork In Bytes [Delta Max]bytes1mThe maximum of system.network.in.bytes_delta for individual networks.
NetworkNetwork In Bytes [Delta Min]bytes1mThe minimum of system.network.in.bytes_delta for individual networks.
NetworkNetwork In Bytes [Delta Sum]bytes1mThe sum of system.network.in.bytes_delta for individual networks.
NetworkNetwork In Bytes [Delta]bytes1mThe delta value of the number of bytes received.
NetworkNetwork In Droppedcnt1mThe number of packets dropped during reception.
NetworkNetwork In Errorscnt1mThe number of errors during reception.
NetworkNetwork In Packetscnt1mThe number of packets received.
NetworkNetwork In Packets [Delta Avg]cnt1mThe average of system.network.in.packets_delta for individual networks.
NetworkNetwork In Packets [Delta Max]cnt1mThe maximum of system.network.in.packets_delta for individual networks.
NetworkNetwork In Packets [Delta Min]cnt1mThe minimum of system.network.in.packets_delta for individual networks.
NetworkNetwork In Packets [Delta Sum]cnt1mThe sum of system.network.in.packets_delta for individual networks.
NetworkNetwork In Packets [Delta]cnt1mThe delta value of the number of packets received.
NetworkNetwork Out Bytesbytes1mThe number of bytes sent.
NetworkNetwork Out Bytes [Delta Avg]bytes1mThe average of system.network.out.bytes_delta for individual networks.
NetworkNetwork Out Bytes [Delta Max]bytes1mThe maximum of system.network.out.bytes_delta for individual networks.
NetworkNetwork Out Bytes [Delta Min]bytes1mThe minimum of system.network.out.bytes_delta for individual networks.
NetworkNetwork Out Bytes [Delta Sum]bytes1mThe sum of system.network.out.bytes_delta for individual networks.
NetworkNetwork Out Bytes [Delta]bytes1mThe delta value of the number of bytes sent.
NetworkNetwork Out Droppedcnt1mThe number of packets dropped during transmission. This value is not reported by the OS and is always 0 on Darwin and BSD.
NetworkNetwork Out Errorscnt1mThe number of errors during transmission.
NetworkNetwork Out Packetscnt1mThe number of packets sent.
NetworkNetwork Out Packets [Delta Avg]cnt1mThe average of system.network.out.packets_delta for individual networks.
NetworkNetwork Out Packets [Delta Max]cnt1mThe maximum of system.network.out.packets_delta for individual networks.
NetworkNetwork Out Packets [Delta Min]cnt1mThe minimum of system.network.out.packets_delta for individual networks.
NetworkNetwork Out Packets [Delta Sum]cnt1mThe sum of system.network.out.packets_delta for individual networks.
NetworkNetwork Out Packets [Delta]cnt1mThe delta value of the number of packets sent.
NetworkOpen Connections [TCP]cnt1mThe number of open TCP connections.
NetworkOpen Connections [UDP]cnt1mThe number of open UDP connections.
NetworkPort Usage%1mThe port usage rate.
NetworkSYN Sent Socketscnt1mThe number of sockets in the SYN_SENT state (when connecting to a remote host).
ProcessKernel PID Maxcnt1mThe kernel.pid_max value.
ProcessKernel Thread Maxcnt1mThe kernel.threads-max value.
ProcessProcess CPU Usage%1mThe percentage of CPU time consumed by the process since the last update. This value is similar to the %CPU value displayed by the top command on Unix systems.
ProcessProcess CPU Usage/Core%1mThe percentage of CPU time used by the process since the last event, normalized by the number of cores (0-100%).
ProcessProcess Memory Usage%1mThe percentage of main memory (RAM) used by the process.
ProcessProcess Memory Usedbytes1mThe resident set size, which is the amount of memory used by the process in RAM. On Windows, it is the current working set size.
ProcessProcess PIDPID1mThe process ID.
ProcessProcess PPIDPID1mThe parent process ID.
ProcessProcesses [Dead]cnt1mThe number of dead processes.
ProcessProcesses [Idle]cnt1mThe number of idle processes.
ProcessProcesses [Running]cnt1mThe number of running processes.
ProcessProcesses [Sleeping]cnt1mThe number of sleeping processes.
ProcessProcesses [Stopped]cnt1mThe number of stopped processes.
ProcessProcesses [Total]cnt1mThe total number of processes.
ProcessProcesses [Unknown]cnt1mThe number of processes with unknown or unsearchable states.
ProcessProcesses [Zombie]cnt1mThe number of zombie processes.
ProcessRunning Process Usage%1mThe process usage rate.
ProcessRunning Processescnt1mThe number of running processes.
ProcessRunning Thread Usage%1mThe thread usage rate.
ProcessRunning Threadscnt1mThe total number of threads running in running processes.
SystemContext Switchescnt1mThe number of context switches per second.
SystemLoad/Core [1 min]cnt1mThe load average over the last 1 minute, normalized by the number of cores.
SystemLoad/Core [15 min]cnt1mThe load average over the last 15 minutes, normalized by the number of cores.
SystemLoad/Core [5 min]cnt1mThe load average over the last 5 minutes, normalized by the number of cores.
SystemMultipaths [Active]cnt1mThe number of active paths for external storage connections.
SystemMultipaths [Failed]cnt1mThe number of failed paths for external storage connections.
SystemMultipaths [Faulty]cnt1mThe number of faulty paths for external storage connections.
SystemNTP Offsetnum1mThe measured offset (time difference between the NTP server and the local environment) of the last sample.
SystemRun Queue Lengthnum1mThe length of the run queue.
SystemUptimems1mThe OS uptime (in milliseconds).
WindowsContext Switchiescnt1mThe number of CPU context switches per second.
WindowsDisk Read Bytes [Sec]cnt1mThe number of bytes read from the Windows logical disk per second.
WindowsDisk Read Time [Avg]sec1mThe average time spent reading data (in seconds).
WindowsDisk Transfer Time [Avg]sec1mThe average disk wait time.
WindowsDisk Usage%1mThe disk usage rate.
WindowsDisk Write Bytes [Sec]cnt1mThe number of bytes written to the Windows logical disk per second.
WindowsDisk Write Time [Avg]sec1mThe average time spent writing data (in seconds).
WindowsPagingfile Usage%1mThe paging file usage rate.
WindowsPool Used [Non Paged]bytes1mThe Nonpaged Pool usage of kernel memory.
WindowsPool Used [Paged]bytes1mThe Paged Pool usage of kernel memory.
WindowsProcess [Running]cnt1mThe number of currently running processes.
WindowsThreads [Running]cnt1mThe number of currently running threads.
WindowsThreads [Waiting]cnt1mThe number of threads waiting for processor time.
Table. Performance Items for Bare Metal Server
Note
To monitor the performance of Bare Metal Server, please install the Agent. Refer to Agent Management for the installation guide.

Storage type

File Storage

Performance item group namePerformance item nameCollection unitCollection cycleDescription
VolumeInstance Statestate1mFile storage volume status
VolumeIOPS [Other]iops1mIOPS (other)
VolumeIOPS [Read]iops1mIOPS (read)
VolumeIOPS [Total]iops1mIOPS (total)
VolumeIOPS [Write]iops1mIOPS (write)
VolumeLatency Time [Other]usec1mLatency time (other)
VolumeLatency Time [Read]usec1mLatency time (read)
VolumeLatency Time [Total]usec1mLatency time (total)
VolumeLatency Time [write]usec1mLatency time (write)
VolumeThroughput [Other]MB/s1mThroughput (other)
VolumeThroughput [Read]MB/s1mThroughput (read)
VolumeThroughput [Total]MB/s1mThroughput (total)
VolumeThroughput [Write]MB/s1mThroughput (write)
VolumeVolume Totalbytes1mTotal bytes
VolumeVolume Usage%1mUsage rate
VolumeVolume Usedbytes1mUsed amount
Table. File Storage performance items

Object Storage

Performance item group namePerformance item nameCollection unitCollection cycleDescription
RequestRequests [Delete]cnt1mNumber of HTTP DELETE requests executed on objects in the bucket
RequestRequests [Download Avg]bytes1mAverage download usage per bucket
RequestRequests [Get]cnt1mNumber of HTTP GET requests executed on objects in the bucket
RequestRequests [Head]cnt1mNumber of HTTP HEAD requests executed on objects in the bucket
RequestRequests [List]cnt1mNumber of LIST requests executed on objects in the bucket
RequestRequests [Post]cnt1mNumber of HTTP POST requests executed on objects in the bucket
RequestRequests [Put]cnt1mNumber of HTTP PUT requests executed on objects in the bucket
RequestRequests [Total]cnt1mTotal number of HTTP requests executed on the bucket
RequestRequests [Upload Avg]bytes1mAverage upload usage per bucket
UsageBucket Usedbytes1mAmount of data stored in the bucket (in bytes)
UsageObjectscnt1mNumber of objects stored in the bucket
Table. Object Storage performance items

Database type

PostgreSQL(DBaaS)

Performance item group namePerformance item nameCollection unitCollection cycleDescription
ActivelockActive Lockscnt1mNumber of active locks
ActivelockActive Locks [Access Exclusive]cnt1mNumber of access exclusive locks
ActivelockActive Locks [Access Share]cnt1mNumber of access share locks
ActivelockActive Locks [Total]cnt1mTotal number of active locks
ActivelockExclusive Lockscnt1mNumber of exclusive locks
ActivelockRow Exclusive Lockscnt1mNumber of row exclusive locks
ActivelockRow Share Lockscnt1mNumber of row share locks
ActivelockShare Lockscnt1mNumber of share locks
ActivelockShare Row Exclusive Lockscnt1mNumber of share row exclusive locks
ActivelockShare Update Exclusive Lockscnt1mNumber of share update exclusive locks
ActiveSessionActive Sessionscnt1mNumber of active sessions
ActiveSessionActive Sessions [Total]cnt1mTotal number of active sessions
ActiveSessionIdle In Transaction Sessionscnt1mNumber of idle in transaction sessions
ActiveSessionIdle In Transaction Sessions [Total]cnt1mTotal number of idle in transaction sessions
ActiveSessionIdle Sessionscnt1mNumber of idle sessions
ActiveSessionIdle Sessions [Total]cnt1mTotal number of idle sessions
ActiveSessionWaiting Sessionscnt1mNumber of waiting sessions
ActiveSessionWaiting Sessions [Total]cnt1mTotal number of waiting sessions
ConnectionConnection Usage%1mDB connection usage rate
ConnectionConnection Usage [Total]%1mTotal DB connection usage rate
DB AgeDB Age Maxage1mDatabase age (frozen XID) value
LockWait Lockscnt1mNumber of sessions waiting for locks (per DB)
LockWait Locks [Long Total]cnt1mNumber of sessions waiting for locks for more than 300 seconds
LockWait Locks [Long]cnt1mNumber of sessions waiting for locks for more than 300 seconds
LockWait Locks [Total]cnt1mTotal number of sessions waiting for locks
Long TransactionTransaction Time Max [Long]sec1mLongest transaction time (in seconds)
Long TransactionTransaction Time Max Total [Long]sec1mLongest transaction time (in seconds)
ReplicaApply Lag Timesec1mApply lag time
ReplicaCheck No Replicationcnt1mCheck no replication value
ReplicaCheck Replicationstate1mCheck replication state value
SlowquerySlowqueriescnt1mNumber of slow queries (more than 5 minutes)
StateInstance State [PID]PID1mPostgres process PID
TablespaceTablespace Usedbytes1mTablespace usage
TablespaceTablespace Used [Total]bytes1mTotal tablespace usage
TablespaceTablespace Used Bytes [MB]bytes1mFilesystem directory usage (in MB)
TablespaceTablespaces [Total]cnt1mTotal number of tablespaces
Table. PostgreSQL(DBaaS) performance items
Note
Refer to Virtual Server performance items for DB instance performance items.

MariaDB(DBaaS)

Performance item group namePerformance item nameCollection unitCollection cycleDescription
ActivelockActive Lockscnt1mNumber of active locks
ActivesssionActive Sessionscnt1mNumber of connected threads
ActivesssionConnection Usage [Total]%1mDB connection usage rate
ActivesssionConnectionscnt1mNumber of connections
ActivesssionConnections [MAX]cnt1mMaximum number of connected threads
DatafileBinary Log Used [MB]bytes1mBinary log usage (in MB)
DatafileData Directory Used [MB]bytes1mDatadir usage (in MB)
DatafileOpen Filescnt1mNumber of open files
DatafileOpen Files [MAX]cnt1mMaximum number of open files
DatafileOpen Files Usage%1mOpen file usage rate
DatafileRelay Log Used [MB]bytes1mRelay log usage (in MB)
StateInstance State [PID]PID1mMariadbd process PID (or mysqld process PID for versions prior to 10.5.2)
StateSafe PIDPID1mMariadbd_safe process PID (or mysqld_safe process PID for versions prior to 10.5.2)
StateSlave Behind Master secondssec1mTime difference between master and slave (in seconds)
TablespaceTablespace Usedbytes1mTablespace usage
TablespaceTablespace Used [Total]bytes1mTotal tablespace usage
TransactionRunning Threadscnt1mNumber of running threads
TransactionSlowqueriescnt1mNumber of slow queries (more than 10 seconds)
TransactionSlowqueries [Total]cnt1mTotal number of slow queries
TransactionTransaction Time [Long]sec1mLongest transaction time (in seconds)
TransactionWait Lockscnt1mNumber of sessions waiting for locks for more than 60 seconds
Table. MariaDB(DBaaS) performance items
Note
Refer to Virtual Server performance items for DB instance performance items.

MySQL(DBaaS)

Performance item group namePerformance item nameCollection unitCollection cycleDescription
ActivelockActive Lockscnt1mNumber of active locks
ActivesssionActive Sessionscnt1mNumber of connected threads
ActivesssionConnection Usage [Total]%1mDB connection usage rate
ActivesssionConnectionscnt1mNumber of connections
ActivesssionConnections [MAX]cnt1mMaximum number of connected threads
DatafileBinary Log Used [MB]bytes1mBinary log usage (in MB)
DatafileData Directory Used [MB]bytes1mDatadir usage (in MB)
DatafileOpen Filescnt1mNumber of open files
DatafileOpen Files [MAX]cnt1mMaximum number of open files
DatafileOpen Files Usage%1mOpen file usage rate
DatafileRelay Log Used [MB]bytes1mRelay log usage (in MB)
StateInstance State [PID]PID1mMysqld process PID
StateSafe PIDPID1mSafe program PID
StateSlave Behind Master secondssec1mTime difference between master and slave (in seconds)
TablespaceTablespace Usedbytes1mTablespace usage
TablespaceTablespace Used [Total]bytes1mTotal tablespace usage
TransactionRunning Threadscnt1mNumber of running threads
TransactionSlowqueriescnt1mNumber of slow queries (more than 10 seconds)
TransactionSlowqueries [Total]cnt1mTotal number of slow queries
TransactionTransaction Time [Long]sec1mLongest transaction time (in seconds)
TransactionWait Lockscnt1mNumber of sessions waiting for locks for more than 60 seconds
Table. MySQL(DBaaS) performance items
Note
Refer to Virtual Server performance items for DB instance performance items.

CacheStore(DBaaS)

Performance item group namePerformance item nameCollection unitCollection cycleDescription
CacheStoreActive Defragmentation Keys [Hits]cnt1mNumber of keys defragmented
CacheStoreActive Defragmentation Keys [Miss]cnt1mNumber of keys skipped during defragmentation
CacheStoreActive Defragmentationd [Hits]cnt1mNumber of values reassigned during defragmentation
CacheStoreActive Defragmentations [Miss]cnt1mNumber of defragmentation processes started and stopped
CacheStoreAllocated Bytes [OS]bytes1mBytes allocated by CacheStore and recognized by the operating system (resident set size)
CacheStoreAllocated Bytes [Redis]bytes1mTotal bytes allocated by CacheStore
CacheStoreAOF Buffer Sizebytes1mAOF buffer size
CacheStoreAOF File Size [Current]bytes1mCurrent AOF file size
CacheStoreAOF File Size [Lastest Startup]bytes1mAOF file size at the last startup or rewrite
CacheStoreAOF Rewrite Buffer Sizebytes1mAOF rewrite buffer size
CacheStoreAOF Rewrite Current Timesec1mTime spent on the current AOF rewrite process
CacheStoreAOF Rewrite Last Timesec1mTime spent on the last AOF rewrite process
CacheStoreCallscnt1mNumber of commands executed (not rejected)
CacheStoreCalls [Failed]cnt1mNumber of failed commands (CacheStore 6.2-rc2)
CacheStoreCalls [Rejected]cnt1mNumber of rejected commands (CacheStore 6.2-rc2)
CacheStoreChanges [Last Saved]cnt1mNumber of changes since the last dump
CacheStoreClient Output Buffer [MAX]cnt1mLongest output list among current client connections
CacheStoreClient Input Buffer [MAX]cnt1mLargest input buffer among current client connections (CacheStore 5.0)
CacheStoreClients [Sentinel]cnt1mNumber of client connections (sentinel)
CacheStoreConnected Slavescnt1mNumber of connected slaves
CacheStoreConnections [Blocked]cnt1mNumber of clients waiting for blocking calls (BLPOP, BRPOP, BRPOPLPUSH)
CacheStoreConnections [Current]cnt1mNumber of client connections (excluding slave connections)
CacheStoreCopy On Write Allocated Size [AOF]bytes1mCOW allocation size (in bytes) during the last RDB save operation
CacheStoreCopy On Write Allocated Size [RDB]bytes1mCOW allocation size (in bytes) during the last RDB save operation
CacheStoreCPU Time [Average]cnt1mAverage CPU usage per command execution
CacheStoreCPU Time [Total]usec1mTotal CPU time used by these commands
CacheStoreCPU Usage [System Process]%1mSystem CPU usage by background processes
CacheStoreCPU Usage [System]%1mSystem CPU usage by the CacheStore server
CacheStoreCPU Usage [User Process]%1mUser CPU usage by background processes
CacheStoreCPU Usage [User]%1mUser CPU usage by the CacheStore server
CacheStoreDataset Usedbytes1mDataset size (in bytes)
CacheStoreDisk Usedbytes1mDatadir usage
CacheStoreEvicted Keyscnt1mNumber of evicted keys due to maxmemory limit
CacheStoreFsyncs [Delayed]cnt1mDelayed fsync counter
CacheStoreFsyncs [Pending]cnt1mNumber of fsync operations pending in the background I/O queue (in bytes)
CacheStoreFull Resyncscnt1mNumber of full resynchronizations with slaves
CacheStoreKeys [Expired]cnt1mTotal number of key expiration events
CacheStoreKeys [Keyspace]cnt1mNumber of keys in the keyspace
CacheStoreLastest Fork Duration Timeusec1mTime taken by the last fork operation (in microseconds)
CacheStoreLookup Keys [Hit]cnt1mNumber of successful key lookups in the main dictionary
CacheStoreLookup Keys [Miss]cnt1mNumber of failed key lookups in the main dictionary
CacheStoreLua Engine Memory Usedbytes1mMemory used by the Lua engine
CacheStoreMaster Last Interaction Time Agosec1mTime elapsed since the last interaction with the master (in seconds)
CacheStoreMaster Last Interaction Time Ago [Sync]sec1mTime elapsed since the last interaction with the master (in seconds)
CacheStoreMaster Offsetpid1mCurrent replication offset of the server
CacheStoreMaster Second Offsetpid1mOffset of the replication ID that will be accepted
CacheStoreMaster Sync Left Bytesbytes1mNumber of bytes remaining to be synchronized
CacheStoreMemory Fragmentation Rate%1mRatio of used_memory_rss to used_memory
CacheStoreMemory Fragmentation Rate [Allocator]%1mFragmentation ratio
CacheStoreMemory Fragmentation Usedbytes1mDifference between used_memory_rss and used_memory (in bytes)
CacheStoreMemory Fragmentation Used [Allocator]bytes1mResident bytes
CacheStoreMemory Max Valuebytes1mMemory limit
CacheStoreMemory Resident [Allocator]bytes1mResident memory
CacheStoreMemory RSS Rate [Allocator]%1mResident ratio
CacheStoreMemory Used [Active]bytes1mActive memory
CacheStoreMemory Used [Allocated]bytes1mAllocated memory
CacheStoreMemory Used [Resident]bytes1mResident bytes
CacheStoreNetwork In Bytes [Total]bytes1mTotal network input (in bytes)
CacheStoreNetwork Out Bytes [Total]bytes1mTotal network output (in bytes)
CacheStoreNetwork Read Ratecnt1mNetwork read rate (in KB/sec)
CacheStoreNetwork Write Ratecnt1mNetwork write rate (in KB/sec)
CacheStorePartial Resync Requests [Accepted]cnt1mNumber of accepted partial resynchronization requests
CacheStorePartial Resync Requests [Denied]cnt1mNumber of denied partial resynchronization requests
CacheStorePeak Memory Consumedbytes1mMaximum memory consumed by CacheStore
CacheStoreProcessed Commandscnt1mNumber of commands processed per second
CacheStoreProcessed Commands [Total]cnt1mTotal number of commands processed
CacheStorePub/Sub Channelscnt1mGlobal number of pub/sub channels with client subscriptions
CacheStorePub/Sub Patternscnt1mGlobal number of pub/sub patterns with client subscriptions
CacheStoreRDB Saved Duration Time [Current]sec1mTime taken by the current RDB save operation (in seconds)
CacheStoreRDB Saved Duration Time [Last]sec1mTime taken by the last RDB save operation (in seconds)
CacheStoreReceived Connections [Total]cnt1mTotal number of connections received
CacheStoreRejected Connections [Total]cnt1mTotal number of connections rejected
CacheStoreReplication Backlog Actove Countcnt1mReplication backlog active flag
CacheStoreReplication Backlog Master Offsetcnt1mMaster offset of the replication backlog buffer
CacheStoreReplication Backlog Sizebytes1mSize of the replication backlog buffer (in bytes)
CacheStoreReplication Backlog Size [Total]bytes1mTotal size of the replication backlog buffer (in bytes)
CacheStoreSlave Prioritycnt1mPriority of the instance as a failover target
CacheStoreSlave Replication Offsetpid1mReplication offset of the slave instance
CacheStoreSlow Operationscnt1mNumber of slow operations
CacheStoreSockets [MIGRATE]cnt1mNumber of sockets opened for migration
CacheStoreTracked Keys [Expiry]cnt1mNumber of keys being tracked for expiry (only for writable slaves)
StateInstance State [PID]PID1mPID of the redis-server process
StateSentinel State [PID]PID1mPID of the sentinel process
Table. CacheStore (DBaaS) Performance Items
Note
Refer to the performance items of the Virtual Server for the performance items of the DB instance.

EPAS

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
ActivelockAccess Exclusive Lockscnt1mNumber of access exclusive locks
ActivelockAccess Share Lockscnt1mNumber of access share locks
ActivelockActive Lockscnt1mNumber of active locks
ActivelockActive Locks [Total]cnt1mTotal number of active locks
ActivelockExclusive Lockscnt1mNumber of exclusive locks
ActivelockRow Exclusive Lockscnt1mNumber of row exclusive locks
ActivelockRow Share Lockscnt1mNumber of row share locks
ActivelockShare Lockscnt1mNumber of share locks
ActivelockShare Row Exclusive Lockscnt1mNumber of share row exclusive locks
ActivelockShare Update Exclusive Lockscnt1mNumber of share update exclusive locks
ActivesessionActive Sessionscnt1mNumber of active sessions
ActivesessionActive Sessions [Total]cnt1mTotal number of active sessions
ActivesessionIdel In Transaction Sessionscnt1mNumber of idle in transaction sessions
ActivesessionIdle In Transaction Sessions [Total]cnt1mTotal number of idle in transaction sessions
ActivesessionIdle Sessionscnt1mNumber of idle sessions
ActivesessionIdle Sessions [Total]cnt1mTotal number of idle sessions
ActivesessionWaiting Sessionscnt1mNumber of waiting sessions
ActivesessionWaiting Sessions [Total]cnt1mTotal number of waiting sessions
ConnectionConnection Usage%1mDB connection usage rate (%)
ConnectionConnection Usage [Total]%1mTotal DB connection usage rate (%)
ConnectionConnection Usage Per DB%1mDB connection usage rate per DB (%)
DB AgeDB Age Maxage1mDatabase age (frozen XID) value
LockWait Lockscnt1mNumber of sessions waiting for locks
LockWait Locks [Long Total]cnt1mTotal number of sessions waiting for locks for a long time
LockWait Locks [Long]cnt1mNumber of sessions waiting for locks for a long time
LockWait Locks [Total]cnt1mTotal number of sessions waiting for locks
LockWait Locks Per DB [Total]cnt1mTotal number of sessions waiting for locks per DB
Long TransactionTransaction Time Max [Long]sec1mMaximum transaction time (in minutes)
Long TransactionTransaction Time Max Total [Long]sec1mMaximum transaction time (in minutes)
ReplicaApply Lag Timesec1mApply lag time
ReplicaCheck No Replicationcnt1mCheck no replication value
ReplicaCheck Replicationstate1mCheck replication state value
SlowquerySlowqueriescnt1mNumber of slow queries
StateInstance state [PID]PID1mPID of the edb-postgres process
TablespaceTablespace Used Bytes [MB]bytes1mFilesystem directory usage (in MB)
TablespaceTablespace [Total]cnt1mTotal number of tablespaces
TablespaceTablespace Usedbytes1mUsed tablespace size
TablespaceTablespace Used [Total]bytes1mTotal used tablespace size
Table. EPAS Performance Items

Microsoft SQL Server

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
ActivelockActive Lockscnt1mNumber of active locks
ActivesssionActive Sessionscnt1mNumber of active sessions
ActivetransactionActive Transactions [Total]cnt1mTotal number of active transactions
ConnectionConnected Userscnt1mNumber of users connected to the system
DatafileDatavolume Size [Free]bytes1mAvailable space
DatafileDBFiles [Not Online]cnt1mNumber of data files that are not online
DatafileTablespace Usedbytes1mUsed data volume size
LockLock Processes [Blocked]cnt1mNumber of SQL processes blocked by other processes
LockLock Waits [Per Second]cnt1mNumber of lock waits per second
SlowqueryBlocking Session IDID1mID of the session blocking the query
SlowquerySlowqueriescnt1mNumber of slow queries
SlowquerySlowquery CPU Timems1mCPU time taken by slow queries
SlowquerySlowquery Execute Context IDID1mID of the execution context of slow queries
SlowquerySlowquery Memory Usagebytes1mMemory usage of slow queries
SlowquerySlowquery Session IDID1mID of the session executing slow queries
SlowquerySlowquery Wait Duration Timems1mWait duration time of slow queries
StateInstance State [Cluster]state1mState of the MSSQL cluster
StateInstance State [PID]PID1mPID of the sqlservr.exe process
StatePage IO Latch Wait Timems1mAverage wait time for page IO latches
TransactionTransaction Time [MAX]cnt1mMaximum transaction time
Table. Microsoft SQL Server

Data Analytics 유형

Event Streams

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
BrokerConnections [Zookeeper Client]cnt1mNumber of ZooKeeper connections
BrokerFailed [Client Fetch Request]cnt1mNumber of failed client fetch requests
BrokerFailed [Produce Request]cnt1mNumber of failed produce requests
BrokerIncomming Messagescnt1mNumber of incoming messages
BrokerLeader Electionscnt1mNumber of leader elections
BrokerLeader Elections [Unclean]cnt1mNumber of unclean leader elections
BrokerLog Flushescnt1mNumber of log flushes
BrokerNetwork In Bytesbytes1mTotal network input (in bytes)
BrokerNetwork Out Bytesbytes1mTotal network output (in bytes)
BrokerRejected Bytesbytes1mTotal rejected bytes
BrokerRequest Queue Lengthcnt1mRequest queue length
BrokerZookeeper Sessions [Closed]cnt1mNumber of closed ZooKeeper sessions
BrokerZookeeper Sessions [Expired]cnt1mNumber of expired ZooKeeper sessions
BrokerZookeeper Sessions [Readonly]cnt1mNumber of read-only ZooKeeper sessions
BrokerIncomming Messages Rate [Topic]cnt1mIncoming message rate per topic
BrokerIncomming Byte Rate [Second]bytes1mIncoming byte rate per second
BrokerOutgoing Byte Rate [Second]bytes1mOutgoing byte rate per second
BrokerRejected Byte Rate [Second]bytes1mRejected byte rate per second
DiskDisk Usedbytes1mDatadir usage
StateAKHQ State [PID]PID1mPID of the akhq process
StateInstance State [PID]PID1mPID of the kafka process
StateZookeeper State [PID]PID1mPID of the zookeeper process
Table. Event Streams

Search Engine

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
ClusterShardscnt1mNumber of shards in the cluster
ClusterShards [Primary]cnt1mNumber of primary shards in the cluster
ClusterIndex [Total]cnt1mTotal number of indices in the cluster
ClusterLicense Expiry Date [ms]ms1mLicense expiry date (in milliseconds)
ClusterLicense Statusstate1mLicense status
ClusterLicense Typetype1mLicense type
FileSystemDisk Usagebytes1mDatadir usage
NodeDocuments [Deleted]cnt1mTotal number of deleted documents
NodeDocuments [Existing]cnt1mTotal number of existing documents
NodeFilesystem Bytes [Available]bytes1mAvailable filesystem bytes
NodeFilesystem Bytes [Free]bytes1mFree filesystem bytes
NodeFilesystem Bytes [Total]bytes1mTotal filesystem bytes
NodeJVM Heap Used [Init]bytes1mInitial JVM heap usage (in bytes)
NodeJVM Heap Used [MAX]bytes1mMaximum JVM heap usage (in bytes)
NodeJVM Non Heap Used [Init]bytes1mInitial JVM non-heap usage (in bytes)
NodeJVM Non Heap Used [MAX]bytes1mMaximum JVM non-heap usage (in bytes)
NodeSegmentscnt1mTotal number of segments
NodeSegments Bytesbytes1mTotal size of segments (in bytes)
NodeStore Bytesbytes1mTotal size of the store (in bytes)
StateInstance state [PID]PID1mPID of the Elasticsearch process
TaskQueue Timems1mQueue time
KibanaKibana state [PID]PID1mPID of the Kibana process
KibanaKibana Connectionscnt1mNumber of connections
KibanaKibana Memory Heap Allocated [Limit]bytes1mMaximum allocated heap size (in bytes)
KibanaKibana Memory Heap Allocated [Total]bytes1mTotal allocated heap size (in bytes)
KibanaKibana Memory Heap Usedbytes1mUsed heap size (in bytes)
KibanaKibana Process Uptimems1mProcess uptime
KibanaKibana Requests [Disconnected]cnt1mNumber of disconnected requests
KibanaKibana Requests [Total]cnt1mTotal number of requests
KibanaKibana Response Time [Avg]ms1mAverage response time
KibanaKibana Response Time [MAX]ms1mMaximum response time
Table. Search Engine

Container 유형

Kubernetes Engine

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
ClusterCluster Namespaces [Active]cnt5mNumber of active namespaces
ClusterCluster Namespaces [Total]cnt5mTotal number of namespaces
ClusterCluster Nodes [Ready]cnt5mNumber of ready nodes
ClusterCluster Nodes [Total]cnt5mTotal number of nodes
ClusterCluster Pods [Failed]cnt5mNumber of failed pods
ClusterCluster Pods [Pending]cnt5mNumber of pending pods
ClusterCluster Pods [Running]cnt5mNumber of running pods
ClusterCluster Pods [Succeeded]cnt5mNumber of succeeded pods
ClusterCluster Pods [Unknown]cnt5mNumber of unknown pods
ClusterInstance Statestate5mCluster state
NamespaceNamespace Pods [Failed]cnt5mNumber of failed pods in the namespace
NamespaceNamespace Pods [Pending]cnt5mNumber of pending pods in the namespace
NamespaceNamespace Pods [Running]cnt5mNumber of running pods in the namespace
NamespaceNamespace Pods [Succeeded]cnt5mNumber of succeeded pods in the namespace
NamespaceNamespace Pods [Unknown]cnt5mNumber of unknown pods in the namespace
NamespaceNamespace GPU Clock FrequencyMHz5mGPU clock frequency
NamespaceNamespace GPU Memory Usage%5mGPU memory usage
NodeNode CPU Size [Allocatable]cnt5mAllocatable CPU size
NodeNode CPU Size [Capacity]cnt5mCPU capacity
NodeNode CPU Usage%5mCPU usage
NodeNode CPU Usage [Request]%5mCPU request ratio
NodeNode CPU Usedstate5mCPU utilization
NodeNode Filesystem Usage%5mFilesystem usage
NodeNode Memory Size [Allocatable]bytes5mAllocatable memory size
NodeNode Memory Size [Capacity]bytes5mMemory capacity
NodeNode Memory Usage%5mMemory usage
NodeNode Memory Usage [Request]%5mMemory request ratio
NodeNode Memory Workingsetbytes5mNode memory working set
NodeNode Network In Bytesbytes5mNode network RX bytes
NodeNode Network Out Bytesbytes5mNode network TX bytes
NodeNode Network Total Bytesbytes5mNode network total bytes
NodeNode Pods [Failed]cnt5mNumber of failed pods in the node
NodeNode Pods [Pending]cnt5mNumber of pending pods in the node
NodeNode Pods [Running]cnt5mNumber of running pods in the node
NodeNode Pods [Succeeded]cnt5mNumber of succeeded pods in the node
NodeNode Pods [Unknown]cnt5mNumber of unknown pods in the node
PodPod CPU Usage [Limit]%5mPod CPU usage limit ratio
PodPod CPU Usage [Request]%5mPod CPU request ratio
PodPod CPU Usagemc5mPod CPU usage
PodPod Memory Usage [Limit]%5mPod memory usage limit ratio
PodPod Memory Usage [Request]%5mPod memory request ratio
PodPod Memory Usagebytes5mPod memory usage
PodPod Network In Bytesbytes5mPod network RX bytes
PodPod Network Out Bytesbytes5mPod network TX bytes
PodPod Network Total Bytesbytes5mPod network total bytes
PodPod Restart Containerscnt5mNumber of container restarts in the pod
WorkloadWorkload Pods [Running]cnt5m-
Table. Kubernetes Engine performance items

Container Registry

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
Container RegistryImage Pulls [Denied]cnt1mNumber of denied image tag (digest) pulls
Container RegistryImage Pushs [Allowed]cnt1mNumber of allowed image tag (digest) pushes
Container RegistryImage Pushs [Denied]cnt1mNumber of denied image tag (digest) pushes
Container RegistryImage Scans [Allowed]cnt1mNumber of allowed image tag (digest) scans
Container RegistryImage Scans [Denied]cnt1mNumber of denied image tag (digest) scans
Container RegistryImage Tags [Deleted]cnt1mNumber of deleted image tags (digests)
Container RegistryImages [Created]cnt1mNumber of created images
Container RegistryImages [Deleted]cnt1mNumber of deleted images
Container RegistryLogins [Allowed]cnt1mNumber of allowed registry logins
Container RegistryLogins [Denied]cnt1mNumber of denied registry logins
Container RegistryRepositories [Created]cnt1mNumber of created repositories
Container RegistryRepositories [Deleted]cnt1mNumber of deleted repositories
StateInstance Statestate1mStatus check
Table. Container Registry performance items

Networking Type

Internet Gateway

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
Internet GatewayNetwork In Total Bytes [Internet Delta]bytes5mCumulative traffic from Internet Gateway to VPC over 5 minutes (Internet)
※ Average traffic bps conversion formula: cumulative traffic (bytes) / 300 (seconds) * 8 (bits)
Internet GatewayNetwork In Total Bytes [Internet]bytes5mRX bytes total
Internet GatewayNetwork Out Total Bytes [Internet Delta]bytes5mCumulative traffic from VPC to Internet Gateway over 5 minutes (Internet)
※ Average traffic bps conversion formula: cumulative traffic (bytes) / 300 (seconds) * 8 (bits)
Internet GatewayNetwork Out Total Bytes [Internet]bytes5mTX bytes total
Table. Internet Gateway performance items

Load Balancer (OLD)

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
Load BalancerCurrent Connectioncnt5mCurrent number of connections
Load BalancerTotal Connectioncnt5mTotal number of connections
Load BalancerTotal Connection [Delta]cnt5mTotal number of connections (delta value)
Load BalancerNetwork In Bytesbytes5mIn bytes
Load BalancerNetwork In Bytes [Delta]bytes5mCumulative traffic from client to Load Balancer over 5 minutes
※ Average traffic bps conversion formula: cumulative traffic (bytes) / 300 (seconds) * 8 (bits)
Load BalancerNetwork Out Bytesbytes5mOut bytes
Load BalancerNetwork Out Bytes [Delta]bytes5mCumulative traffic from Load Balancer to client over 5 minutes
※ Average traffic bps conversion formula: cumulative traffic (bytes) / 300 (seconds) * 8 (bits)
Load BalancerInstance Statestate5mLoad Balancer status
Table. Load Balancer performance items

Load Balancer Listener (OLD)

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
ListenerConnections [Current]cnt5mCurrent number of connections
ListenerConnections [Total Delta]cnt5mTotal number of connections (delta value)
ListenerConnections [Total]cnt5mTotal number of connections
ListenerInstance Statestate5mLB Listener status
ListenerNetwork In Bytesbytes5mIn bytes
ListenerNetwork In Bytes [Delta]bytes5mCumulative traffic from client to Load Balancer over 5 minutes
※ Average traffic bps conversion formula: cumulative traffic (bytes) / 300 (seconds) * 8 (bits)
ListenerNetwork Out Bytesbytes5mOut bytes
ListenerNetwork Out Bytes [Delta]bytes5mCumulative traffic from Load Balancer to client over 5 minutes
※ Average traffic bps conversion formula: cumulative traffic (bytes) / 300 (seconds) * 8 (bits)
Table. Load Balancer Listener performance items

Direct Connect

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
Direct ConnectNetwork In Bytesbytes5mCumulative traffic from Direct Connect to VPC
Direct ConnectNetwork In Bytes [Delta]bytes5mCumulative traffic from Direct Connect to VPC over 5 minutes
※ Average traffic bps conversion formula: cumulative traffic (bytes) / 300 (seconds) * 8 (bits)
Direct ConnectNetwork Out Bytesbytes5mCumulative traffic from VPC to Direct Connect
Direct ConnectNetwork Out Bytes [Delta]bytes5mCumulative traffic from VPC to Direct Connect over 5 minutes
※ Average traffic bps conversion formula: cumulative traffic (bytes) / 300 (seconds) * 8 (bits)
Table. Direct Connect performance items

Load Balancer

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
StateInstance Statestate5mLB status
Load BalancerCurrent Connectioncnt5mCurrent number of connections
Load BalancerTotal L4 Connectioncnt5mTotal number of L4 connections
Load BalancerTotal L7 Connectioncnt5mTotal number of L7 connections
Load BalancerTotal TCP Connectioncnt5mTotal number of TCP connections
Load BalancerTotal Connectioncnt5mTotal number of connections
Load BalancerBytes processed in forward directionbytes5mForward network bytes
Load BalancerPackets processed in forward directioncnt5mForward network packets
Load BalancerBytes processed in reverse directionbytes5mReverse network bytes
Load BalancerPackets processed in reverse directioncnt5mReverse network packets
Load BalancerTotal failure actionscnt5mTotal number of failure actions
Load BalancerCurrent Requestcnt5mCurrent number of requests
Load BalancerCurrent responsecnt5mCurrent number of responses
Load BalancerTotal Requestcnt5mTotal number of requests
Load BalancerTotal Request Successcnt5mTotal number of successful requests
Load BalancerPeak Connectioncnt5mPeak number of connections
Load BalancerCurrent Connection Rate%5mCurrent SSL connection rate
Load BalancerLast response timems5mLast response time
Load BalancerFastest response timems5mFastest response time
Load BalancerSlowest response timems5mSlowest response time
Load BalancerCurrent SSL Connectioncnt5mCurrent number of SSL connections
Load BalancerTotal SSL Connectioncnt5mTotal number of SSL connections
Table. Load Balancer performance items

Load Balancer Listener

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
StateInstance Statestate5mLB status
Load BalancerCurrent Connectioncnt5mCurrent number of connections
Load BalancerTotal L4 Connectioncnt5mTotal number of L4 connections
Load BalancerTotal L7 Connectioncnt5mTotal number of L7 connections
Load BalancerTotal TCP Connectioncnt5mTotal number of TCP connections
Load BalancerTotal Connectioncnt5mTotal number of connections
Load BalancerBytes processed in forward directionbytes5mForward network bytes
Load BalancerPackets processed in forward directioncnt5mForward network packets
Load BalancerBytes processed in reverse directionbytes5mReverse network bytes
Load BalancerPackets processed in reverse directioncnt5mReverse network packets
Load BalancerTotal failure actionscnt5mTotal number of failure actions
Load BalancerCurrent Requestcnt5mCurrent number of requests
Load BalancerCurrent responsecnt5mCurrent number of responses
Load BalancerTotal Requestcnt5mTotal number of requests
Load BalancerTotal Request Successcnt5mTotal number of successful requests
Load BalancerPeak Connectioncnt5mPeak number of connections
Load BalancerCurrent Connection Rate%5mCurrent SSL connection rate
Load BalancerLast response timems5mLast response time
Load BalancerFastest response timems5mFastest response time
Load BalancerSlowest response timems5mSlowest response time
Load BalancerCurrent SSL Connectioncnt5mCurrent number of SSL connections
Load BalancerTotal SSL Connectioncnt5mTotal number of SSL connections
Table. Load Balancer Listener performance items

Load Balancer Server Group

Performance Item Group NamePerformance Item NameCollection UnitCollection CycleDescription
Server GroupInstance Statestate5mLB Server Group status
Server GroupPeak Connectioncnt5mServer group peak number of connections
Server GroupHealthy hostcnt5mServer group number of healthy hosts
Server GroupUnhealthy hostcnt5mServer group number of unhealthy hosts
Server GroupRequest Countcnt5mNumber of requests
Server GroupResponse Countcnt5mNumber of responses
Server Group2xx Response Countcnt5mNumber of 2xx responses
Server Group3xx Response Countcnt5mNumber of 3xx responses
Server Group4xx Response Countcnt5mNumber of 4xx responses
Server Group5xx Response Countcnt5mNumber of 5xx responses
Table. Load Balancer Server Group performance items

2.9 - Appendix C. Service-specific status check

Compute type

Virtual Server

Performance Item NameDescriptionValue
Instance State [Basic]Instance StatusNOSTATE, RUNNING, BLOCKED, PAUSED, SHUTDOWN, SHUTOFF, CRASHED, PMSUSPENDED, LAST
Fig. Virtual Server Status Check

GPU Server

Performance Item NameDescriptionValue
Instance State [Basic]Instance StatusNOSTATE RUNNING, BLOCKED, PAUSED, SHUTDOWN, SHUTOFF, CRASHED, PMSUSPENDED, LAST
Fig. GPU Server Status Check

Bare Metal Server

Performance Item NameDescriptionValue
N/AN/AN/A
Fig. Bare Metal Server Status Check
Caution
Bare Metal Server does not provide status information through Cloud Monitoring.

Multi-node GPU Cluster [Cluster Fabric]

Performance Item NameDescriptionValue
N/AN/AN/A
Fig. Multi-node GPU Cluster [Cluster Fabric] status check
Caution
Multi-node GPU Cluster [Cluster Fabric] does not provide status information through Cloud Monitoring.

Multi-node GPU Cluster [Node]

Performance Item NameDescriptionValue
N/AN/AN/A
Fig. Multi-node GPU Cluster [Node] Status Check
Caution
Multi-node GPU Cluster [Node] does not provide status information through Cloud Monitoring.

Storage type

File Storage

Performance Item NameDescriptionValue
Instance StateFile Storage volume status* 1: Online in case
* 0: other status values (Offline)
Fig. File Storage status check

Object Storage

Performance Item NameDescriptionValue
N/AN/AN/A
Table. Object Storage status check
Caution
Object Storage does not provide status information through Cloud Monitoring.

Block Storage(BM)

Performance Item NameDescriptionValue
Instance StateBlockstorage Volume Status* 1: running (normal)
* 0: down (abnormal)
Fig. Block Storage(BM) Status Check

Block Storage(VM)

Performance Item NameDescriptionValue
Instance StateBlockstorage volume status* 1: running (normal)
* 0: down (abnormal)
Table. Block Storage(VM) Status Check

Database type

PostgreSQL(DBaaS)

Performance Item NameDescriptionValue
Instance State [PID]postgres process PID* PID: when the postgres process exists
* -1: when the process does not exist
Fig. PostgreSQL(DBaaS) Status Check

MariaDB(DBaaS)

Performance Item NameDescriptionValue
Safe PIDmariadb_safe process PID* PID: when the mariadb_safe process exists
* -1: when the process does not exist
Instance State [PID]mariadb process PID* PID: when the mariadb process exists
* -1: when the process does not exist
Fig. MariaDB(DBaaS) Status Check

MySQL(DBaaS)

Performance Item NameDescriptionValue
Instance State [PID]mysqld process PID* PID: when the mysqld process exists
* -1: when the process does not exist
Fig. MySQL(DBaaS) Status Check

Microsoft SQL Server(DBaaS)

Performance Item NameDescriptionValue
Instance State [Cluster]MSSQL cluster configuration status* PID: when the mssql process exists
* -1: when the process does not exist
Instance State [PID]sqlservr.exe process pid* For Microsoft SQL Server, the secondary server is also running with PID, so it’s impossible to check the status with only PID
Fig. Microsoft SQL Server(DBaaS) status check

EPAS(DBaaS)

Performance Item NameDescriptionValue
Instance State [PID]Postgres process PID* PID: When the postgres process exists
* -1: When the process does not exist
Fig. EPAS(DBaaS) Status Check

CacheStore(DBaaS)

Redis

Performance Item NameDescriptionValue
Instance State [PID]Redis-server process PID* -1: in case the process does not exist
Sentinel State [PID]Sentinel process PID* -1: in case the process does not exist
Fig. Redis Status Check

Valkey

Performance Item NameDescriptionValue
Instance State [PID]Valkey-server process PID* -1: in case the process does not exist
Sentinel State [PID]Sentinel process PID* -1: in case the process does not exist
Table. Valkey Status Check

Data Analytics type

Event Streams

Performance Item NameDescriptionValue
AKHQ State [PID]akhq process PID* PID: akhq process exists
* -1: process does not exist
Instance State [PID]kafka process PID* PID: when the kafka process exists
* -1: when the process does not exist
Zookeeper State [Pid]zookeeper process PID* PID: when the zookeeper process exists
* -1: when the process does not exist
Fig. Event Streams Status Check

Search Engine

Performance Item NameDescriptionValue
Instance State [PID]Elasticsearch process PID* PID: When the Elasticsearch process exists
* -1: When the process does not exist
Kibana State [PID]Kibana process PID* PID: When the Kibana process exists
* -1: When the process does not exist
Fig. Search Engine Status Check

Elasticsearch

Performance Item NameDescriptionValue
Instance State [PID]Elasticsearch process PID* -1: in case the process does not exist
Kibana State [PID]Dashboard process PID* -1: in case the process does not exist
Fig. Elasticsearch Status Check

Opensearch

Performance Item NameDescriptionValue
Instance State [PID]Opensearch process PID* -1: in case the process does not exist
Dashboard State [PID]Dashboard process PID* -1: in case the process does not exist
Table. Opensearch status check

Vertica(DBaaS)

Performance Item NameDescriptionValue
Instance State [PID]Vertica Process PID* -1: when the process does not exist
Fig. Vertica(DBaaS) status check

Container type

Kubernetes Engine

Performance Item NameDescriptionValue
Instance StateCluster Status* 1: Status check query sum(up{job=“kubernetes-apiservers”}) returns a value greater than 0
* 0: Status check query sum(up{job=“kubernetes-apiservers”}) returns a value less than or equal to 0
Fig. Kubernetes Engine status check

Container Registry

Performance Item NameDescriptionValue
Instance StateContainer Registry Status* 1: running (normal)
* 0: down (abnormal)
Fig. Container Registry Status Check

Networking type

Internet Gateway

Performance Item NameDescriptionValue
N/AN/AN/A
Fig. Internet Gateway Status Check
Caution
Internet Gateway does not provide status information through Cloud Monitoring.

Load Balancer(OLD)

Performance Item NameDescriptionValue
Instance StateLoad Balancer statusDetermined by provisioning_status in API call result
  • 1: ACTIVE
  • 0: ETC|
Fig. Load Balancer(OLD)

Load Balancer Listener(OLD)

Performance Item NameDescriptionValue
Instance StateLoad Balancer Listener statusDetermined by provisioning_status in API call results
* 1: ACTIVE
* 0: ETC
Fig. Load Balancer Listener(OLD)

Load Balancer

Performance Item NameDescriptionValue
Instance StateLoad Balancer statusDetermined by provisioning_status in API call result
  • 1: ACTIVE
  • 0: ETC|
Fig. Load Balancer

Load Balancer Listener

Performance Item NameDescriptionValue
Instance StateLoad Balancer Listener status, determined by provisioning_status in API call results
* 1: ACTIVE
* 0: ETC
Fig. Load Balancer Listener

Load Balancer Server Group

Performance Item NameDescriptionValue
Instance StateStatus of Load Balancer Server Group, determined by provisioning_status in API call results
* 1: ACTIVE
* 0: ETC
Fig. Load Balancer Server Group

Direct Connect

Performance Item NameDescriptionValue
N/AN/AN/A
Fig. Direct Connect Status Check
Caution
Direct Connect does not provide status information through Cloud Monitoring.

Cloud WAN

Performance Item NameDescriptionValue
Instance StateAttachment connection status* 0: down
* 1: up
* 2: testing
* 3: unknown
Fig. Cloud WAN Status Check

Global CDN

Performance Item NameDescriptionValue
Instance StateGlobal CDN Status* 1: running (normal)
* 0: down (abnormal)
Fig. Global CDN Status Check

3 - API Reference

API Reference

4 - Release Note

Cloud Monitoring

2025.07.01
FEATURE Cloud Monitoring Integration Service Added
  • In July 2025, a linked service with Cloud Monitoring was added.
    • Additional linked services: Compute(Multi-node GPU Cluster [Cluster Fabric], Multi-node GPU Cluster [Node]), Storage(Block Storage(BM), Block Storage(VM)), Networking(Cloud WAN, Global CDN), Database(Valkey), Data Analytics(Opensearch, Vertica(DBaaS))
2025.02.27
FEATURE Cloud Monitoring Integration Service Added
  • In February 2025, a linked service with Cloud Monitoring was added.
    • Additional linked services: Container(Container Registry), Database(EPAS, Microsoft SQL Server), Data Analytics(Event Streams, Search Engine), Networking(Load Balancer, Load Balancer Listener, Load Balancer Server Group, VPN)
2024.10.01
NEW Cloud Monitoring Service Official Version Release
  • Cloud Monitoring service has been released. It collects usage and change information of operating infrastructure resources, and supports a stable cloud operating environment through event occurrence/notification when exceeding the set threshold.