How to Perform a Detail Exadata Health Check

Exadata is a significant investment for any customer and one should make sure to maximize investment by configuring Exadata machine as per best practices and utilize all the features of engineered systems. Oracle has provided an array of tools for Exadata machine, but we see a gap between standard Exadata configuration vs a truly optimize Exadata machine. Exachk is a great tool provided by Oracle to validate Exadata configuration and Oracle best practices, but it’s designed as a standard tool for all Exadata machines. Exachk is not specific to a particular type of workload or application and doesn’t investigate enchantment opportunities to achieve extreme performance from Exadata machine.

That is why you should perform a detail Exadata health check of your Exadata machine which goes above and beyond Exachk validation and Oracle Enterprise Manager monitoring capabilities. The goal of this health check is to maximize the Exadata investment and reduce the number of incidents which can impact the availability of critical applications. Here is list of task you should perform to perform a detail Exadata Health check

  1. Review Exachk report to evaluate Exadata configuration, MAA Best practices, and database critical issues.
  2. Review various types of Exadata logs including Exawatcher, alert, trace, CRS, ASM, listener.
  3. Review Flash cache contents, verify smart flash log feature and check write-back cache functionality.
  4. Review Exadata feature usage like HCC Compression, Smart Scan, offloading, Storage Indexes
  5. Review Maximum Availability Architecture including backup of critical configuration files
  6. Review and validate Oracle Enterprise Manager Configuration of Exadata plugin.
  7. Review resource utilization at storage & database level and provide recommendations.
  8. Review AWR reports for contention and slow running processes.
  9. Review database parameter settings as per Oracle best practices including hidden parameters.
  10. Review log retention policy to optimize storage utilization and maintain historical data for troubleshooting any future issues.