IDENTIFYING SLOW NODES IN DISTRIBUTED APPLICATIONS
Inventors
Gregg Bernard Lesartre, Aaron J. Hoelscher, Duncan Roweth
Abstract
One aspect of the instant disclosure provides a method and system for identifying slow nodes among a plurality of nodes executing a distributed application. During operation, in response to receiving a trigger signal at a node, the system may monitor traffic to or from the node by measuring durations of one or more non-paused idle periods. In response to determining that a duration of a non-paused idle period falls within a predetermined idle-period duration range, the system may increment a corresponding counter. The system may generate a histogram for the node based on counter values corresponding to a plurality of idle-period duration ranges and identify one or more slow nodes based on histograms associated with the plurality of nodes.
CPC Classifications
Filing Date
2024-09-26
Application No.
18897686