Advice about monitoring disk health

From DISI
Revision as of 18:17, 13 March 2025 by Frodo (talk | contribs) (asdf)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

We run scrub regularly - every 3 months or more often.

We also have the following advice courtesy of Andrew Davis via Wynton.

More important than ZFS scrubs is monitoring Smart. For SATA, it's looking at Reallocated_Sector_Ct, Current_Pending_Sector, and Offline_Uncorrectable. If any of those go above 0, replace the drive.

For SAS, it's a bit more challenging since SAS doesn't expose raw values. For those we do routine smart testing - weekly short, and monthly long:

  • short
foreach file (/dev/da? /dev/da?? /dev/da???)
smartctl -t short $file
end
  • long
foreach file (/dev/da? /dev/da?? /dev/da???)
smartctl -t long $file
end