Inspur software tool for HPC cluster optimization and diagnosis TEYE V2.0

TEYE is a HPC application feature analysis software, to extract the resource utilization information of large scale HPC cluster system, and feedback application feature in real time.  TEYE exploits the potential computing capability of the application for bottleneck diagnosis, algorithm and parallel efficiency improvement.   System resource optimization and computing performance improvement are top priority for TEYE.

TEYE trilogy
System Parameters monitoring- Analysis & Diagnosis – System Optimization

System Parameters monitoring – comprehensive analysis

Limited system resources are occupied by TEYE installation and usage.  There are more than 40Microarchitecture parameters (CPU, MEM, network and, file system), which could be accurately monitored and extracted by TEYE.   These large amounts of raw data are provided for post analysis, diagnosis and optimization.       

CPU Microarchitecture
•    CPU:Usr%, sys%, idle%, iowait%
•    Float:X87 GFLOPS, SP/DP SSE scalar/packed GFLOPS, SP/DP AVX scalar/packed GFLOPS
•    Vectorization:SP/DP SSE VEC, SP/DP AVX VEC
•    Instruction efficiency:CPI

MEM:
•    MEM volume:Total, used, cached, buffered
•    MEM access:Memory reading bandwidth、Memory writing bandwidth
•    PCI-Express acess:PCIE reading-bandwidth、PCIE writing-bandwidth

 

Network:
•    Equipment support:Gigabit, Infiniband
•    Protocol support:TCP/IP, UDP, RDMA, IPoIB
•    Traffic monitoring:Ethernet-received/ transmitted、IB-received/ transmitted
•    Package monitoring:Ethernet/IB  package receive/transmitted

File system:
•    Local disk:local reading/writing、local block data reading/writing
•    NFS:NFS client reading/writing、NFS server reading/writing

Analysis & Diagnosis – Raw data visualization
Based on knowledge and experiences on HPC application, Inspur TEYE offers more readable and flexible data graphics from raw data extraction.

Multiple analysis methods

System-level: analysingthe running status of each node, to find out the key parameters.
Node-level :  analysing the running status of parameters, to find out the bottleneck and output the analysis chart report.
Parameters-level: comparing with the same parameters from different nodes, analysing the load balance of nodes, and eliminates the bottleneck of nodes.


        System-level                                                  Node-level                                  Parameters-level

Various graphic reports

Distribution of data:
Displaying featured data and probability distribution of interval, so as to understand the feature of the application and the optimization direction
Hot spot analysis:
Automatic analysis and identification of the points where performance data changes violently, analysing the feature of application at this point, providing the optimization.
Radar map of features:
According to the requirements of main indicators, generated radar map of features, finding the key parameters and performance bottlenecks

        Hot spot analysis                                        Distribution of data                       Radar map

 

Optimization – efficient hardware & software configuration

Inspur application tuning Engineer combines graphic data with industry experience, to configure the most suitable solution for end user requirements, minimize the gap between hardware platform and application software, and maximize cluster performance and resource utilization.

  • Hardware optimization
    According to the analysis result, to supplement the bottleneck resources, and gain the best performance with the minimum investment.
  • Software optimization:
    Through coding level optimization, to parallelise an application with existing infrastructure.

Core value
In order to improve the parallel efficiency of application, optimize cluster performance, and minimize power consumption.  TEYE outputs various graphic charts with comprehensive cluster Microarchitecture parameters, for application analysis and diagnosis, and further system configuration.
TEYE has been become an efficient tool of analysis and diagnosis for HPC administrators, application developer and system optimizer in a large scale HPC environment.

 

 

Home Products Industry Application Partners News About us
©1996 - 2014 INSPUR Co., Ltd. 鲁ICP备05019369号