Help Center > > User Guide> O&M> O&M

O&M

Updated at: Oct 09, 2019 GMT+08:00

The O&M page provides full-link, multi-layer, and one-stop O&M for resources, applications, and user experience. Specifically, this page displays 12 types of cards: application performance monitoring, user experience monitoring, infrastructure monitoring, application monitoring, statistics information, host monitoring (CPU and memory), service monitoring (CPU and memory), instance monitoring (CPU and memory), host monitoring (disk), host monitoring (network), cluster monitoring (CPU and memory), and cluster monitoring (disk) cards.

Application Performance Monitoring Card

Figure 1 Application performance monitoring

This card mainly displays application performance data, including the Application Performance Index (Apdex) circle graph and metric trend graph. The data is collected and processed by Application Performance Management (APM). If there is no data in this card and is displayed when you move the mouse cursor to this card, click the button to enable APM.

  • Apdex circle graph. Each segment in the graph indicates an application. The applications monitored are those using APM probes. (For details, see APM-Getting Started.) The color of each segment indicates the user satisfaction with application performance (red: poor user experience, 0 ≤ Apdex < 0.3; yellow: fair user experience, 0.3 ≤ Apdex < 0.75; green: good user experience, 0.75 ≤ Apdex ≤ 1). The proportion of each segment is related to the Apdex value of the application. The larger the Apdex value, the higher the proportion, and the higher the user satisfaction with the application performance. In the preceding circle graph, there are three segments, indicating three applications. One application has poor user experience, one fair user experience, and one good user experience. The proportion of the green segment is the largest, so the corresponding application has the largest Apdex value and highest user satisfaction.
  • Throughput and latency trend graphs in the last 1 hour, with a statistical period of 5 minutes. The values above the trend graphs respectively indicate the average throughput and latency of all applications in the last one hour.
  • Top 3 applications with high throughput and top 3 applications with long latency in the last one hour.

Infrastructure Monitoring Card

Figure 2 Infrastructure monitoring

This card displays infrastructure metrics. You can select a cluster to view information about one or all clusters. In the preceding figure, all clusters are selected. Information includes:

  • Host running status, CPU usage, physical memory usage, and disk usage.
  • Trend graph of network traffic data in the last one hour. The values of each point in the graph respectively indicate the total downlink and uplink traffic of all clusters in one minute. The values displayed above the trend graph respectively indicate the total downlink and uplink traffic of all clusters at the latest time point.
  • Trend graph of CPU usage and memory usage in the last one hour. The values of each point in the graph respectively indicate the average CPU usage and memory usage of all clusters in one minute. The values displayed above the trend graph respectively indicate the average CPU usage and memory usage of all clusters at the latest time point.

Application Monitoring Card

Figure 3 Application monitoring

This card displays application metrics.

  1. Running status of applications, services, containers, and instances.
  2. Select an application. The following information is displayed:
    • Trend graph of network traffic data in the last one hour. The values of each point in the graph respectively indicate the total downlink and uplink traffic of the selected application in one minute. The values displayed above the trend graph respectively indicate the downlink and uplink traffic of the selected application at the latest time point.
    • Trend graph of CPU usage and memory usage in the last one hour. The values of each point in the graph respectively indicate the CPU usage and memory usage of the selected application in one minute. The values displayed above the graph indicate the CPU usage and memory usage of the selected application at the latest time point.

Statistics Information Card

Figure 4 Statistics information

This card displays the trend graph of alarms, log quotas, threshold rules, log usage, and host usage. The trend graphs of log and host usage respectively display data in the last 15 days.

Host Monitoring (CPU and Memory) Card

Figure 5 Host monitoring (CPU and memory)

The information includes:

  • Top 5 hosts with high CPU and memory usage in the last one minute.
  • Trend graph of the CPU and memory usage of the selected host in the last one hour. The values of each point in the graph respectively indicate the CPU and memory usage of the host in the last one minute.
  • CPU and memory usage of the selected host at the latest time point, which is above the graph.

Service Monitoring (CPU and Memory) Card

Figure 6 Service monitoring (CPU and memory)

The information includes:

  • Top 5 services with high CPU and memory usage in the last one minute.
  • Trend graph of the CPU and memory usage of the selected service in the last one hour. The values of each point in the graph respectively indicate the CPU and memory usage of the service in the last one minute.
  • CPU and memory usage of the selected service at the latest time point, which is above the trend graph.
  • option, which can be selected to hide system services.

Instance Monitoring (CPU and Memory) Card

Figure 7 Instance monitoring (CPU and memory)

The information includes:

  • Top 5 instances with high CPU and memory usage in the last one minute.
  • Trend graph of the CPU and memory usage of the selected instance in the last one hour. The values of each point in the graph respectively indicate the CPU and memory usage of the instance in the last one minute.
  • CPU and memory usage of the selected instance at the latest time point, which is above the trend graph.
  • option, which can be selected to hide system instances.

Host Monitoring (Disk) Card

Figure 8 Host monitoring (disk)

The information includes:

  • Top 5 hosts with high disk read/write rate in the last one minute.
  • Trend graph of the disk read/write rate of the selected host in the last one hour. The values of each point in the graph respectively indicate the disk read/write rate of the selected host in the last one minute.
  • Disk read/write rate of the selected host at the latest time point, which is above the trend graph.

Host Monitoring (Network) Card

Figure 9 Host monitoring (network)

The information includes:

  • Top 5 hosts with high uplink/downlink network traffic in the last one minute.
  • Trend graph of the uplink/downlink network traffic of the selected host in the last one hour. The values of each point in the graph respectively indicate the uplink/downlink network traffic of the selected host in the last one minute.
  • Uplink/downlink network traffic of the selected host at the latest time point, which is above the trend.

Cluster Monitoring (CPU and Memory) Card

Figure 10 Cluster monitoring (CPU and memory)

The information includes:

  • Top 5 clusters with high CPU and memory usage in the last one minute.
  • Trend graph of the CPU and memory usage of the selected cluster in the last one hour. The values of each point in the graph respectively indicate the CPU and memory usage of the cluster in the last one minute.
  • CPU and memory usage of the selected cluster at the latest time point, which is above the trend graph.

Cluster Monitoring (Disk) Card

Figure 11 Cluster monitoring (disk)

The information includes:

  • Top 5 clusters with high disk usage in the last one minute.
  • Trend graph of the disk usage of the selected cluster in the last one hour. The value of each point in the graph indicates the disk usage of the cluster in the last one minute.
  • Disk usage of the selected cluster at the latest time point, which is above the trend graph.

Related Operations

You can perform the operations described in Table 1.

Table 1 Related operations

Operation

Description

Adding a card to Favorites

If you do not want to view a card, click in the upper right corner of the card and choose Add to Favorites. After a card is added to Favorites, it is hidden from the O&M page. If you want to view the card later, obtain it from Favorites.

Adding the top 5 resource graph in the card to the dashboard

Click in the upper right corner of the card and choose Add to Dashboard.

NOTE:
  • This operation cannot be performed on the application performance monitoring card, user experience monitoring card, infrastructure monitoring card, application monitoring card, or statistics information card.
  • There are multiple metrics in the top 5 resource graph in the card. If you specify a metric for sorting and then add the graph to the dashboard, the metric data of the top 5 resources will be displayed on the dashboard.
  • Differences between the top 5 resource graph in the card and that created in the dashboard:

    1. Statistical ranges are different. For the former, resources in all clusters are covered. For the latter, only resources in a cluster are covered.

    2. Metric ranges are different. For the former, only the metrics displayed in the graph can be added to the dashboard. These metrics are fixed. For the latter, many metrics are available and can be customized.

    3. For the former, only top 5 resources can be viewed. For the latter, both top 5 and top 15 resources can be viewed.

    4. For the former, data cannot be edited after being added to the dashboard. For the latter, data can be edited.

Zooming in a metric graph

Click in the upper right corner of the metric graph.

Drilling down blue texts

Click the blue texts, such as Host, Application, or Service to drill down to the details page.

Did you find this page helpful?

Submit successfully!

Thank you for your feedback. Your feedback helps make our documentation better.

Failed to submit the feedback. Please try again later.

Which of the following issues have you encountered?







Please complete at least one feedback item.

Content most length 200 character

Content is empty.

OK Cancel