|
|
 |
|
| SMART way to prevent data loss - Part 2. |
Self-Monitoring, Analysis and Reporting Technology systems (SMART) are built in to most modern ATA and SCSI hard disks. SMART disk drives internally monitor their own health and performance. SMART technology features include a set of attributes, which determine reliability-prediction parameters of drive and should not be exceeded under normal operation. Each attribute has an identification number ID. Self-Monitoring, Analysis and Reporting Technology systems (SMART) are built in to most modern ATA and SCSI hard disks. SMART disk drives internally monitor their own health and performance. SMART technology features include a set of attributes, which determine reliability-prediction parameters of drive and should not be exceeded under normal operation. Each attribute has an identification number (ID).
Some types of reliability parameters are:
|
| - Distance between the heads and the disk platters; |
| - Faulty sectors; |
| - Recalibration; |
| - Drive spin-up time; |
| - Drive temperature; |
| - Characteristics of the media; |
| - Motor and servomechanisms. |
Attribute value is a positive integral number, usually in the range from 1 to 253. Initially, all attributes have maximum values. A value of 100 or 200 will often be chosen as the "normal" value. Some attributes are considered life-critical and others are just "informative". In case of hard drive wearing or when some components of the disk are about to fail, attributes indicate decreasing amount of values. Consequently, high values determine high reliability of the drive and low values - low reliability or high possibility of drive failure. Specific threshold is assigned to each attribute. Once the value drops below this threshold, SMART considers the disk to be faulty, which means it becomes very dangerous to store data on this drive.
The following list describes some critical hard drive attributes.
|
| ID |
Attribute name |
Description |
| 01 |
Read Error Rate |
Indicates the rate of hardware read errors that occurred when reading data from a disk surface. Lower values indicate a problem with either disk surface or read/write heads. |
| 05 |
Reallocated Sectors Count |
Count of reallocated sectors. When the hard drive finds a read/write/verification error, it marks this sector as "reallocated" and transfers data to a special reserved area. The more sectors that are reallocated, the more read/write speed will decrease. |
| 11 |
Recalibration Retries |
This attribute indicates the number of times recalibration was requested (under the condition that the first attempt was unsuccessful). A decrease of this attribute value is a sign of problems in the hard disk mechanical subsystem. |
| 194 |
Temperature |
Current internal temperature. |
| 196 |
Reallocation Event Count |
Count of reallocation operations. The raw value of this attribute shows the total number of attempts to transfer data from reallocated sectors to a spare area. |
| 197 |
Current Pending Sector Count |
Number of "unstable" sectors. When unstable sectors are read successfully, the value is decreased. If errors occur when reading a sector, the drive will attempt to recover the data, transfer it to the reserved area and mark the sector as remapped. |
| 198 |
Uncorrectable Sector Count |
The total number of uncorrectable errors when reading/writing a sector. A rise in the value of this attribute indicates defects of the disk surface and/or problems in the mechanical subsystem. |
| 220 |
Disk Shift |
Distance the disk has shifted relative to the spindle (usually due to shock). Unit of measure is unknown. |
Currently, the SMART system can detect about 70% of all hard drive errors. Its main shortcoming is that it doesn't provide a direct mechanism for informing the OS or the user if problems are found. In fact, because disk SMART status is frequently not monitored, many disk problems go undetected until they lead to a catastrophic failure.
Monitoring a drive's behavior, SMART has the purpose of warning a user about the threat of drive collapse while time remains to take preventive action, such as back up the data to a replacement device. So why not use SMART monitor programs freely available on Internet to cut these problems off at the pass?
|
Previous Page
|
|
|
|
|