Wednesday, March 29, 2017

Netezza: nzhealthcheck System Health Check tool




With Netezza NPS release 7.2  the new version of nzhealthcheck 2.3.1.2 tool has introduced a number of changes:

1)  creates 26 views and 1 table in a target database. (See list of table and views here

          TIP: Change target database by setting NZ_DATABASE variable if needed. 

2)  loads data into the target table leaving nzlog and data files behing that clutter /tmp directory. (See session example

          TIP: Schedule periodic /tmp directory cleanup :

          find /tmp -maxdepth 1 -name "tmp.*" -mtime +2 -print 2>/dev/null | xargs rm -rf 
    



Rules document verified by  nzhealthcheck 2.3.1.2 :




nzhealthcheck usage examples

/nz/support/bin/adm/nzhealthcheck/nzhealthcheck --help
Usage: nzhealthcheck [OPTION...] [sysinfo] [minisysinfo]

Netezza System Health Check
        Netezza System Health Check is a tool that scans the Netezza appliance
        for hardware and software issues. The results appear on-screen in a text
        report that lists the issues found and describes how that issue impacts
        NPS operations. The report also offers guidance to fix the issue.

Notes:
        To run the tool, log in as the nz user and run the nzhealthcheck command.
        No parameters are required. The Netezza system can be in any operational
        state, although some issues, especially related to SPU components, are
        found only when the system is in the Online state.

  sysinfo                    Produce Sysinfo report
  minisysinfo                Produce MiniSysinfo report

  -a, --detail               Show all rule results
  -S, --standAlone           Run nzhealthcheck in stand-alone mode
  -?, --help                 Give this help list
      --usage                Give a short usage message
  -V, -v, --version          Print version

 /nz/support/bin/adm/nzhealthcheck/nzhealthcheck --version
2.3.1.2


 /nz/support/bin/adm/nzhealthcheck/nzhealthcheck --detail
Please run with -S as this is nzhealthcheck standalone version

 /nz/support/bin/adm/nzhealthcheck/nzhealthcheck -S --detail

Netezza System Health Check 2.3.1.2
Collecting monitoring data
Please provide root password when prompted to enable host monitoring or hit ^D to skip

Password:

Evaluating troubleshooting rules
... full output in attachment ... 


/nz/support/bin/adm/nzhealthcheck/nzhealthcheck -S minisysinfo

Netezza System Health Check 2.3.1.2
 + Product     : IBM PureData System for Analytics N1001-005
 + Model       : P50X_A
 + HPF         : 5.3.4
 + FDT         : 4.1.1.1
 + NPS         : 7.2.1.3-P4 [Build 49731]
 + NPS State   : online
 + MTM(s)      : MTM has not been set
 + NzId        :
 + NZ Owner    : nz
 + OS          : Red Hat Enterprise Linux Server release 5.10 (Tikanga)
 + Kernel      : 2.6.18-371.9.1.el5
 + HealthCheck : 2.3.1.2 [20170217043257]
 + Hostname    : svr
 + NPS  Up Time: 6 days, 21 hrs, 22 mins, 15 secs
 + Host Up Time: Host1 : 581 days 8 hours 19 minutes 27 seconds
 + Host Up Time: Host2 : 581 days 8 hours 17 minutes 34 seconds

All done



/nz/support/bin/adm/nzhealthcheck/nzhealthcheck -S sysinfo
Netezza System Health Check 2.3.1.2

Collecting monitoring data

Please provide root password when prompted to enable host monitoring or hit ^D to skip

Password:

Preparing SysInfo Report

- Frontend Hosts Utilization and Statistics
--- Host DIMMs
--- Host Fans
--- Host Power
--- Host Power Supply
--- Host SAS Controllers
--- Host SAS Controllers Batteries
--- Host Disks
--- Host CPU Utilization
--- Host Memory Utilization
--- Host Network Interfaces Configuration
--- Host Network Interfaces - RX Stats
--- Host Network Interfaces - TX Stats
--- Host Cluster State
--- crmnode
--- drbd
--- Host Filesystem Utilization
--- Host Timeshift
--- Host Uptime
--- Host System
- Open files on host
- Chassis Network Subsystem State and Usage Statistics
--- Ports Configuration
--- Ports RX Stats
--- Ports TX Stats
--- Management Ports
- Management Network Subsystem State and Usage Statistics
--- Ports Configuration
--- Ports RX Stats
--- Ports TX Stats
--- Management Ports
- Chassis Components Configuration and State
--- Overall System Status
--- Blades
--- MMs
--- Fans
--- Blowers
--- PWRs
- Blade detailed configuration and usage statistics
--- CPU Information
--- Memory Usage (all values in kB)
--- Network Interfaces Configuration
--- Network Interfaces - RX Stats
--- Network Interfaces - TX Stats
--- SDR (reported via ipmitool)
- RPC and Outlets Configuration
- NPS SPUs
- NPS DACs
- NPS FPGAs
- Section Disk Enclosures
--- Disk Enclosure Slots
--- Disk Enclosure Fans
--- Disk Enclosure Management Modules
--- Disk Enclosure Power Units
--- Disk Enclosure Voltage
--- Disk Enclosure Temperature
- Disks in Enclosures
- Disks Logical Partitions
- SAS PHYs of ESMs
- SAS PHYs of HBAs
- SAS PHYs
- Switches Zonemaps
- SAS PHYs of Switches
- SAS PHYs Switches Errors
- SPU disk paths
--- Paths
- SAS Switch errors
- NPS version and state
- NPS Inventory (nzhw)
- NPS Data Slices (nzds)
- /nz/kit/bin/nzds -regenstatus results
- NPS Catalog Size
- NPS Catalog Files larger that 1M
- /nzlocal/scripts/hpf_health results
- Environment variables
- /opt/nz/fdt/sys_rev_check results
- /nz/kit/bin/nzstats results
- /opt/nz-hwsupport/pts/pts-check.pl results
- /nzlocal/scripts/hpf_ping.sh results
- Results of concheck
- Disk error correction delay
- Number of operations transferred by disks compared to other disks present in appliance
- Disk resets
- CallHome information

Sysinfo Report stored in /nz/support-IBM_Netezza-7.2.1.3.P4-170217-0432/bin/adm/nzhealthcheck/bin/..//Netezza_System_Health_Check_Sysinfo_Report.log

All done


Views and a table create by nzhealthcheck 2.3.1.2.  TIP: Create reminder comments to differentiate these objects from the rest of the clutter. 

--- Table ---
  • V_DISK_DOM
--- Views ---
  • V_DFPGA_ERRORS
  • V_DFPGA_ERRORS_BY_DATASLICE
  • V_DFPGA_ERRORS_BY_DISK
  • V_DFPGA_ERRORS_BY_ENCL
  • V_DFPGA_ERRORS_BY_ERROR
  • V_DFPGA_ERRORS_BY_LBA
  • V_DFPGA_ERRORS_BY_LOCATION
  • V_DFPGA_ERRORS_BY_SPA
  • V_DFPGA_ERRORS_BY_SPU
  • V_DFPGA_ERRORS_BY_SPU_DATASLICE
  • V_DFPGA_ERRORS_BY_TABLE
  • V_DFPGA_ERRORS_RETURNED
  • V_DIR_BAD_DATA
  • V_DIR_DUPLICATE
  • V_DIR_DUPLICATE_KEY
  • V_DISK_SMARTATTR_BYNAME
  • V_DISK_SMARTATTR_DETAIL
  • V_PART_BAD_DATA
  • V_PART_DUPLICATE
  • V_PART_DUPLICATE_KEY
  • V_SCSI_DISK
  • V_SPU_ORPHAN_TABLE
  • V_ZMAP_BAD_DATA
  • V_ZMAP_DUPLICATE
  • V_ZMAP_DUPLICATE_KEY
  • V_ZMAP_ORPHAN_EXTENT

DDL definition: 



Rules can be manually modified to suffice the individual requirements. See Rules PDF document attached above. 

Location: /nz/support-IBM_Netezza-7.2.0.5.P1-150715-2158/bin/adm/nzhealthcheck/





REFERENCE

 System Health Check tool  (PureData System for Analytics 7.2.1)


No comments:

Post a Comment