5.2 CompTIA A+ · Core 1 (220-1201) · Domain 5 — Hardware & Network Troubleshooting

Troubleshooting Drive
and RAID Issues

Objective 5.2 Domain weight: 28% 13 testable symptoms

OVERVIEWIntroduction to Storage Troubleshooting

Storage drives — hard disk drives (HDDs), solid-state drives (SSDs), and NVMe drives — are among the most failure-prone components in a computer. Unlike CPU or RAM failures which can sometimes be resolved by reseating or replacing a module, a drive failure often means permanent, unrecoverable data loss. This makes storage troubleshooting one of the highest-stakes disciplines in IT support.

Objective 5.2 requires you to recognize thirteen distinct symptoms of drive and RAID failure, understand their causes, know how to diagnose them, and understand the appropriate response — including when to prioritize data recovery over repair. This section covers all thirteen symptoms in depth, along with the foundational knowledge of RAID levels needed to interpret RAID-specific symptoms.

Data First — Always

The cardinal rule of storage troubleshooting: back up or recover data before performing any repair actions. Running tools like chkdsk on a physically failing drive, or attempting a rebuild on a degraded RAID without a backup, can accelerate data loss. Identify the scope of the problem first. Recover data second. Repair third.

FOUNDATIONStorage Technologies Overview

Before diagnosing symptoms, understanding the fundamental differences between storage technologies is essential — because many symptoms are technology-specific.

HDD (Hard Disk Drive) Mechanical device with spinning platters and a moving read/write head on an actuator arm. Data is read magnetically. Susceptible to physical shock, vibration, and mechanical wear. Produces audible noise during normal operation. Fails mechanically (head crash, motor failure) or magnetically (bad sectors).

SATA SSD No moving parts. Stores data in NAND flash memory cells. Silent, faster than HDD, more shock-resistant. Fails electronically — the controller chip, NAND cells, or onboard cache can fail. Does not produce warning sounds. Often fails suddenly without the gradual degradation pattern of HDDs.

NVMe SSD (M.2 / PCIe) Same NAND flash technology as SATA SSD but connected via PCIe lanes directly to the CPU, bypassing the SATA controller. Dramatically faster than SATA SSD. Failure modes are similar to SATA SSD: electronic/controller failure, NAND wear, or heat-related failure (NVMe drives can overheat in systems without proper airflow).

RAID Array Multiple physical drives combined by a RAID controller (hardware or software) to appear as one logical volume. Provides redundancy, performance, or both depending on RAID level. A RAID is not a backup — it protects against drive failure, not accidental deletion, ransomware, or controller failure.

Exam Focus — HDD vs SSD Symptoms

Many symptoms in this objective (grinding, clicking, audible alarms) are HDD-specific. SSDs do not produce mechanical sounds. When the exam describes audible symptoms from a drive, the drive in question is always an HDD. Silent failures (missing drive, S.M.A.R.T. errors, data corruption) can occur with either technology.

FOUNDATIONRAID Levels — Deep Reference

Several symptoms in this objective (RAID failure, array missing, audible alarms) require a solid understanding of how each RAID level works, how many drive failures it can survive, and what happens when it fails. This section provides a complete reference.

RAID Levels Compared

Level	Min Drives	Fault Tolerance	Usable Capacity	Performance	How it works
RAID 0	2	None — 0 drive failures	100% of all drives	Fastest reads/writes	Striping only. Data split across all drives. Zero redundancy. One drive fails = all data lost.
RAID 1	2	1 drive failure	50% (size of one drive)	Read improved; writes same	Full mirror. Every write goes to both drives simultaneously. Either drive can serve reads.
RAID 5	3	1 drive failure	N-1 drives	Good read; write penalty	Striping with distributed parity. Parity rotates across all drives. One failed drive can be rebuilt from parity + remaining data.
RAID 6	4	2 drive failures	N-2 drives	Good read; larger write penalty	Striping with dual distributed parity. Two separate parity sets. Can survive any two simultaneous drive failures.
RAID 10	4	1 per mirrored pair	50%	Best overall	Stripe of mirrors (RAID 1+0). Data striped across mirrored pairs. Very high fault tolerance if failures are in different pairs.

RAID Visualized

Data Layout by RAID Level

RAID 0

← Striped. Fast. Zero fault tolerance.

RAID 1

DATA

← Exact mirror. Either drive can fail.

RAID 5

← Parity rotates. 1 drive can fail.

RAID 6

← Dual parity. 2 drives can fail.

RAID 10

← Mirrored pairs, then striped.

FAILED

DEAD

← RAID 5 degraded — still running, but vulnerable.

RAID States

A RAID array can exist in several states. Understanding these states is essential for interpreting RAID-related symptoms on the exam.

Optimal / Normal All drives present and healthy. The array is operating at full performance and full redundancy.

Degraded One (or in RAID 6, two) drives have failed. The array is still functional and accessible, but redundancy is reduced or gone. Performance is degraded. A second failure at this point (in RAID 5) would cause total data loss. This state requires immediate attention.

Rebuilding A replacement drive has been inserted and the controller is regenerating the missing data from parity or mirror data. The array is accessible but performance is severely degraded during rebuilding. Rebuilds can take hours to days depending on drive size.

Failed / Offline Too many drives have failed for the RAID level to tolerate. The array is offline and data is inaccessible without recovery tools. For RAID 5, this means 2+ drive failures; for RAID 0, any single failure.

Critical Concept — RAID Is Not a Backup

RAID protects against drive hardware failure only. It does not protect against: accidental file deletion (deleted from one mirror = deleted from both), ransomware encryption (encrypts all drives in the array simultaneously), controller failure (if the RAID controller fails, all drives may be unreadable by a different controller), fire/flood/theft (all drives physically co-located), or software corruption. A proper backup strategy follows the 3-2-1 rule: 3 copies, on 2 different media types, with 1 offsite or offline.

SYMPTOM 01LED Status Indicators

LED status indicators on drives, RAID controllers, and server hardware provide a real-time visual representation of drive health. Understanding what each color and pattern means is essential for quickly assessing a situation without using software tools — particularly in server environments where drives are hot-swappable.

Common LED Patterns

LED Color / Pattern	Meaning	Action Required
Solid green	Drive healthy and operational	None — normal
Blinking green	Drive activity (reads/writes occurring)	None — normal activity
Solid amber / yellow	Drive has failed or is in a fault state	Replace the drive immediately
Blinking amber	Drive is predictively failing (S.M.A.R.T. warning) or array is rebuilding	Back up data; prepare replacement
Blinking green + amber (alternating)	RAID rebuild in progress on this drive	Do not remove the drive; wait for rebuild
No LED / dark	Drive not detected, not seated, or no power	Reseat drive; check power and data cables
Red	Drive failed (some systems use red instead of amber)	Replace the drive

Context Note

LED behavior is not fully standardized across manufacturers. Dell, HP, and Lenovo server hardware all use slightly different color schemes. The exam tests the general concept — amber/yellow = fault/warning, green = healthy — rather than any specific vendor's implementation. Always consult the system documentation for precise meanings.

Drive Activity vs. Drive Health LEDs

Many drives and systems have two separate LEDs: one for activity (read/write operations — expected to blink frequently) and one for health status. A common mistake is mistaking a solid health LED for a drive that is "on" but not being used. Distinguish between them: sustained lack of any activity LED during a period when the drive should be active (like during a system boot or large file copy) can indicate a problem.

SYMPTOM 02Grinding Noises

A grinding noise from a computer's storage system is one of the most urgent symptoms a technician can encounter. Unlike the normal operational sounds of an HDD (quiet spinning, occasional soft seek sounds), grinding indicates mechanical destruction occurring inside the drive in real time.

What Causes Grinding

Inside an HDD, the read/write heads float nanometers above the spinning platters on a cushion of air — they never physically touch during normal operation. This is called the head gap. When the head crashes — meaning it physically contacts the platter surface — the result is catastrophic. The head scrapes the magnetic coating off the platter, destroying both the head and the data.

A head crash produces a distinctive grinding or scraping sound. Other causes of grinding include:

Failed spindle motor bearing — the motor that spins the platter develops a worn bearing, causing metal-on-metal grinding as it rotates
Debris inside the drive — contamination from a dropped drive or failed head coating enters the air gap
Actuator arm failure — the arm that positions the read/write head becomes stuck or damaged

Immediate Response Protocol

Power off the system immediately. Every additional second the drive spins causes more physical damage and more data loss. Do not attempt to run diagnostics on a grinding drive. Do not attempt chkdsk. Contact a data recovery specialist if the data is critical — physical recovery requires a cleanroom environment and is expensive. If the drive is already backed up, simply replace it.

It bears repeating that SSDs and NVMe drives cannot grind — they have no moving parts. A grinding sound is unambiguously an HDD symptom. If a system appears to make a grinding sound but contains only SSDs, investigate fans and other mechanical components.

SYMPTOM 03Clicking Sounds

Clicking sounds from an HDD are a well-known warning sign and should be treated with the same urgency as grinding. The most infamous pattern — a repeated clicking with brief pauses — is colloquially known as the click of death.

Causes of HDD Clicking

Click of death The read/write head attempts to park (return to the landing zone), fails to read the servo positioning data, and resets repeatedly. Each reset cycle produces a click. This is a sign that the head assembly or the positioning servo data on the drive is damaged. Data recovery is often still possible at this stage if acted upon quickly.

Actuator seeking Normal HDDs produce soft clicking sounds as the actuator arm moves the head between tracks during heavy read/write activity. This is normal. The click of death is distinguished by its regularity — it repeats at consistent intervals (typically 2–5 seconds) even when no data access is occurring.

PCB failure A failed circuit board on the bottom of the drive can prevent it from properly initializing, causing the head to repeatedly attempt and fail to seek — producing clicking.

Insufficient power Underpowered HDDs (insufficient +12V from the PSU) may click as they attempt to spin up but fail to reach full rotational speed. This can occur when a PSU is overloaded or failing.

Response — Clicking HDD

Stop using the drive immediately. Any read/write activity on a clicking drive risks accelerating physical damage.

Attempt a data backup if the drive is still accessible. Use imaging tools like ddrescue (Linux) or Recuva to copy data off the drive. Work quickly.

Check power connections. If clicking began after adding new drives or a hardware change, verify the PSU can supply adequate power. Try a different SATA power connector.

Do not freeze the drive. The "freeze the drive" trick is a myth — it can cause condensation inside the drive and cause more damage. Do not attempt this.

If data is critical and drive is inaccessible, engage a professional data recovery service. Services like Ontrack or DriveSavers have cleanroom facilities for physical recovery.

SYMPTOM 04Bootable Device Not Found

When a system powers on and the firmware cannot locate a valid bootable device, it displays an error message before handing off to the OS. Common messages include: "No boot device found", "Boot device not found", "Reboot and select proper boot device", "NTLDR is missing", or "Operating system not found."

This symptom has a wide range of causes — from a simple misconfigured boot order to a completely failed drive — and must be diagnosed methodically.

Cause Hierarchy — Most to Least Likely

Cause	Why It Happens	How to Verify
Wrong boot device in BIOS	Boot order set to USB or optical drive first; empty drive is selected	Enter BIOS/UEFI; verify boot order lists the OS drive first
Drive not detected by BIOS	Loose cable, failed drive, or failed SATA port	Check BIOS storage screen — is the drive listed?
Corrupt bootloader	Windows boot files (BCD/MBR) are damaged; drive is healthy but unbootable	Drive visible in BIOS; boot into WinRE and run Startup Repair or `bootrec`
MBR/GPT corruption	Master Boot Record or GUID Partition Table is damaged	Boot from USB; use `diskpart` to inspect partition table
Drive hardware failure	Drive has failed and cannot be read at all	Drive absent from BIOS; no activity LED; listen for sounds
Secure Boot conflict	Non-signed bootloader blocked by UEFI Secure Boot	Disable Secure Boot in UEFI temporarily and test
SATA mode mismatch	Drive configured as AHCI in BIOS but OS installed in IDE mode (or vice versa)	BIOS SATA controller mode setting

Windows Boot Repair Tools

# Boot from Windows installation USB → Repair your computer → Command Prompt bootrec /fixmbr # Rewrites the Master Boot Record bootrec /fixboot # Rewrites the boot sector of the active partition bootrec /rebuildbcd # Scans all drives and rebuilds the Boot Configuration Data bootrec /scanos # Scans for Windows installations not in the BCD # If the above fail, try Startup Repair from WinRE (automated) # Or use diskpart to inspect partition table: diskpart DISKPART> list disk DISKPART> select disk 0 DISKPART> list partition

Exam Focus

The exam frequently presents "bootable device not found" as the result of either a misconfigured BIOS boot order (most common, simplest fix) or a corrupt bootloader. The first step is always to check the BIOS boot order. If the boot order is correct and the drive is visible in BIOS, the next step is Startup Repair from Windows Recovery Environment (WinRE).

SYMPTOM 05Data Loss / Corruption

Data loss refers to files that disappear entirely. Data corruption refers to files that are present but contain incorrect, unreadable, or garbled content. Both symptoms can originate from the same underlying causes — the difference is simply whether the file system still has a reference to the data (corruption) or has lost it entirely (loss).

Causes of Data Loss and Corruption

Physical / Hardware Causes

Bad sectors (HDD) — physical areas of the platter that can no longer reliably store data
NAND cell wear (SSD) — flash cells that have exceeded their write endurance
Sudden power loss — write operations interrupted mid-way leave files in partial state
Head crash — physical destruction of platter surface and data
Controller failure — SSD/NVMe controller fails, making data inaccessible

Logical / Software Causes

File system corruption — MFT (NTFS Master File Table) or FAT damaged
Improper ejection — removing a USB drive mid-write corrupts the file system
Malware / ransomware — deliberately corrupts or encrypts files
Accidental deletion — user error; recoverable with tools if not overwritten
Partition table corruption — entire partitions become inaccessible

Diagnosing and Recovering Data

Data Loss / Corruption Response

Stop writing to the drive. Every new write operation overwrites sectors that may contain recoverable deleted data. If data recovery is needed, do not install software, save files, or even boot from the affected drive.

Run CHKDSK for logical errors. chkdsk C: /f /r — the /f flag fixes file system errors; /r locates bad sectors and attempts to recover readable data. Only use on a drive that is not making physical failure sounds.

Check S.M.A.R.T. data. Use CrystalDiskInfo to read the drive's health. Reallocated sectors, pending sectors, or uncorrectable errors indicate physical damage.

Run SFC for OS file corruption. sfc /scannow checks and repairs corrupted Windows system files — distinct from drive-level corruption.

Use data recovery software for deleted files. Tools like Recuva (Windows) or PhotoRec (cross-platform) can recover recently deleted files if the sectors have not been overwritten.

Restore from backup. If the data cannot be recovered and a backup exists, restore from the most recent clean backup. Document what was lost between the last backup and the failure event.

SYMPTOM 06RAID Failure

RAID failure occurs when enough drives in an array have failed to exceed the array's fault tolerance. The result depends on the RAID level and the number of failed drives. A RAID failure can mean the array is degraded (still running, but at risk) or completely offline (data inaccessible).

Failure Thresholds by RAID Level

RAID Level	Degraded at	Failed (data inaccessible) at	Behavior during degraded state
RAID 0	N/A — no redundancy	Any 1 drive failure	Immediately offline; all data lost
RAID 1	1 drive failure	Both drives fail	Runs on surviving drive; can rebuild to new drive
RAID 5	1 drive failure	2+ drive failures	Continues running; second failure = total data loss
RAID 6	1–2 drive failures	3+ drive failures	Continues running through two failures; second failure still degraded
RAID 10	1 drive per pair	Both drives in same pair fail	Continues if failures are in different pairs; fails if both drives in one pair fail

RAID Rebuild Process

When a failed drive in a redundant RAID is replaced, the controller initiates a rebuild — the process of regenerating the missing data from the remaining drives' parity or mirror data onto the new drive. This process is critical to understand for the exam.

The array is accessible during rebuild but performance is severely degraded — all drives are under heavy read load
Rebuild time scales with drive size — a 4TB RAID 5 rebuild can take 24+ hours
RAID 5 is vulnerable during rebuild — if a second drive fails during the rebuild of the first, all data is lost. This is called a double-fault scenario and is a well-known risk, especially in large-capacity arrays
Do not interrupt a rebuild — removing a drive during rebuild can corrupt the array
Back up before rebuilding whenever possible — if a rebuild fails, the array can be left in an unrecoverable state

Exam Focus

The exam will test whether you understand what happens when specific drives in specific RAID levels fail. Key facts: RAID 0 has no fault tolerance — any single failure destroys all data. RAID 5 needs exactly 3+ drives and can only tolerate 1 failure. A second failure during rebuild is a common scenario question. RAID 10 can tolerate multiple failures as long as they are not both from the same mirrored pair.

SYMPTOM 07S.M.A.R.T. Failure

S.M.A.R.T. (Self-Monitoring, Analysis, and Reporting Technology) is a monitoring system built into most modern HDDs, SSDs, and NVMe drives. The drive's firmware continuously monitors dozens of internal health parameters and stores their values. These values can be read by the OS or diagnostic tools to predict and detect drive failures before they become catastrophic.

How S.M.A.R.T. Works

Each S.M.A.R.T. attribute has a current value, a worst-ever value, a threshold, and a raw value. When the current value falls below the threshold for a critical attribute, the drive reports a S.M.A.R.T. failure — a prediction that the drive is likely to fail soon. The system BIOS may display a warning at boot: "S.M.A.R.T. Failure Predicted on Hard Disk. Immediately back up your data and replace the hard disk drive."

Critical S.M.A.R.T. Attributes

Attribute ID	Name	What It Measures	Significance
0x05 (5)	Reallocated Sectors Count	Number of bad sectors that have been remapped to spare sectors	Critical — any non-zero value indicates physical platter damage; rising count = imminent failure
0xC5 (197)	Current Pending Sectors	Sectors flagged as unstable, waiting to be remapped	Critical — these sectors cannot be reliably read right now; data loss may occur on next read
0xC6 (198)	Uncorrectable Sector Count	Sectors that could not be read after multiple attempts	Critical — data in these sectors is already lost; drive replacement urgent
0x01 (1)	Read Error Rate	Frequency of errors when reading data from the platter	Important — rising rate indicates head or platter degradation
0xC2 (194)	Temperature	Current drive operating temperature in Celsius	Above 55°C consistently = thermal stress reducing drive lifespan
0x09 (9)	Power-On Hours	Total hours the drive has been powered on	Context — drives beyond 40,000–50,000 hours are aging; factor in with other attributes
0xF1 (241)	Total LBAs Written (SSD)	Total data written to the SSD over its lifetime	SSD-specific — compare to manufacturer's endurance rating (TBW)

Reading S.M.A.R.T. Data

# Windows — CrystalDiskInfo (GUI tool, most user-friendly) # Shows color-coded health: Good (blue), Caution (yellow), Bad (red) # Windows Command Line wmic diskdrive get status # Returns: OK, Pred Fail, Error, Unknown # Linux sudo smartctl -a /dev/sda # Full S.M.A.R.T. report sudo smartctl -H /dev/sda # Health status only # Example healthy output SMART overall-health self-assessment test result: PASSED # Example failing output SMART overall-health self-assessment test result: FAILED! Drive failure expected in less than 24 hours. SAVE ALL DATA.

Exam Focus

A S.M.A.R.T. failure message at POST is a prediction, not a current failure — the drive is still working but is forecasted to fail. The correct response is: immediately back up all data, then replace the drive. Do not ignore the warning. Do not simply run CHKDSK. Back up data first, then replace the drive.

SYMPTOM 08Extended Read/Write Times

When read or write operations that previously completed quickly begin taking significantly longer, the drive is experiencing one of several conditions — all of which warrant investigation. Extended read/write times are one of the earliest and most actionable warning signs of impending drive failure.

Causes by Drive Type

HDD — bad sectors When the drive encounters a bad sector, the firmware makes multiple read attempts before marking the sector as unreadable. Each retry cycle adds seconds to the read time. A file that crosses multiple bad sectors can take minutes to read instead of milliseconds. This causes "application hang" symptoms where a program appears frozen while waiting for the drive.

HDD — fragmentation Severe file system fragmentation forces the read/write head to physically seek across many different areas of the platter to read one logical file, increasing access time. Less common as a cause of suddenly extended times (fragmentation develops gradually), but relevant for HDDs that have never been defragmented.

SSD — NAND wear As SSD NAND cells reach the end of their write endurance, error correction overhead increases. The controller performs more ECC (error-correcting code) operations per read, slowing throughput. Sustained writes may also slow as the drive runs out of free cells and must perform garbage collection.

NVMe — thermal throttling NVMe drives in systems without adequate airflow (particularly thin laptops) can overheat and thermally throttle, dropping from 3,000+ MB/s to under 500 MB/s. This manifests as sudden dramatic slowdowns during sustained writes.

RAID rebuild overhead During a RAID rebuild, all drives in the array are under heavy read load simultaneously. Applications that access the array during a rebuild will experience significantly extended read/write times.

Diagnosing Extended Read/Write Times

Run a S.M.A.R.T. check. Elevated reallocated or pending sectors confirm bad sectors as the cause of slow reads on an HDD.

Run CHKDSK. chkdsk C: /r identifies and attempts to recover bad sectors. Note: this will be slow if many bad sectors exist — that slowness itself is diagnostic.

Run a drive benchmark. Tools like CrystalDiskMark compare current sequential and random read/write speeds against the drive's rated specification. A drive operating at 10% of its rated speed is clearly degraded.

Check drive temperature. For NVMe drives specifically, check temperature under sustained load. Consistent temperatures above 70°C indicate thermal throttling.

Check RAID status. If the storage is on a RAID, verify the array is not in a degraded or rebuilding state — both conditions cause dramatically extended access times.

SYMPTOM 09Low Performance IOPS

IOPS (Input/Output Operations Per Second) measures how many individual read or write operations a storage device can perform per second. It is the most important performance metric for workloads involving many small, random accesses — such as databases, virtual machines, and operating system boot drives.

Typical IOPS by Drive Type

Drive Type	Sequential Read	Typical IOPS (4K Random)	Notes
HDD (7200 RPM)	~150 MB/s	~100–200 IOPS	Mechanical seek time is the bottleneck for random access
SATA SSD	~550 MB/s	~50,000–100,000 IOPS	Flash memory eliminates seek time; dramatically faster for random I/O
NVMe SSD (Gen 3)	~3,500 MB/s	~300,000–500,000 IOPS	PCIe interface removes SATA controller bottleneck
NVMe SSD (Gen 4)	~7,000 MB/s	~800,000–1,000,000 IOPS	Current consumer flagship performance tier

Causes of Low IOPS

An HDD performing at 50 IOPS instead of its rated 150 IOPS may indicate mechanical degradation — bad sectors causing retries, or head positioning issues. An SSD at 10,000 IOPS instead of 90,000 IOPS suggests severe NAND wear, a failing controller, or thermal throttling.

In RAID configurations, low IOPS can indicate a degraded array (operating without parity protection requires extra read operations for RAID 5/6 reconstruction), or a drive in the array that is underperforming and dragging down the entire array's throughput.

Practical Note

IOPS is typically measured with tools like CrystalDiskMark (Windows) or fio (Linux). On the exam, you will not be asked to calculate IOPS — you will be expected to understand that low IOPS is a symptom of drive degradation, and that HDDs have dramatically lower IOPS than SSDs due to mechanical seek time.

SYMPTOM 10Missing Drives in OS

A drive that was previously accessible in the operating system has disappeared — it no longer shows up in File Explorer, Disk Management, or Device Manager. The drive may still be physically connected and even visible in BIOS. This symptom requires a structured diagnostic approach because the failure point can be anywhere from a loose cable to a failed file system to a completely dead drive.

Diagnostic Decision Tree

Question	Yes	No
Is the drive visible in BIOS/UEFI?	Drive hardware is alive — problem is logical or driver	Drive is completely failed, or cable/power issue
Is the drive visible in Disk Management?	Likely unformatted, wrong partition, or no drive letter assigned	Driver issue or OS-level detection failure
Does it show as "Unallocated" in Disk Management?	Partition table deleted/corrupt — use recovery software	Check for offline status; right-click → Online
Is the drive marked "Offline" in Disk Management?	Right-click → Online; may require administrator permission	Check for missing drive letter — assign one
Does S.M.A.R.T. show errors?	Physical drive failure — replace and restore from backup	Logical issue — attempt file system repair

Resolving Missing Drive in OS

Check BIOS first. Enter BIOS/UEFI and look at the storage device list. If the drive appears here, the hardware is communicating and the problem is in the OS layer.

Open Disk Management (diskmgmt.msc). Drives that are present but not assigned a letter, or marked as offline, will appear here but not in File Explorer. Right-click the drive → Online, then assign a drive letter.

Check Device Manager for errors. A yellow exclamation mark on a storage controller or disk indicates a driver problem. Update or reinstall the driver.

Check physical connections. If the drive is absent from BIOS, reseat the SATA/power cable, try a different SATA port and cable, and try a different power connector from the PSU.

Test the drive in another system. Connect the drive externally via USB-to-SATA adapter or in another machine to determine if the drive itself is faulty or the original system's port/controller is the problem.

SYMPTOM 11Array Missing

An array missing condition occurs when the RAID controller cannot locate or reconstruct the RAID array. The individual physical drives may be present, but the logical RAID volume they form is not visible to the operating system. This is a more severe condition than a single missing drive because it affects all data on the entire array.

Causes of a Missing Array

Controller failure The RAID controller (hardware or software) that manages the array has failed or been replaced. Even if the drives are intact, a different controller model may not be able to read the array's metadata configuration. Hardware RAID configurations are particularly susceptible — the array metadata is stored on the controller, not the drives.

Multiple drive failures The array has lost more drives than its fault tolerance allows. RAID 5 with 2 failed drives, or RAID 0 with any failed drive, will present as an array missing condition because the data cannot be reconstructed.

Array configuration deleted Someone accidentally deleted the RAID configuration through the RAID management utility or BIOS. The drives still contain the data stripes but the controller no longer knows how to assemble them. This is a logical, not physical, failure — data may be recoverable.

Drive inserted out of order In some RAID configurations, replacing drives in the wrong slots or booting with drives in a different order than expected can cause the controller to fail to recognize the array.

Corrupted RAID metadata The metadata sectors on each drive that describe the RAID configuration (stripe size, drive order, RAID level) have been corrupted. RAID recovery software can sometimes reconstruct this.

Do Not Initialize

When Windows Disk Management detects disks from a missing array, it may prompt you to "Initialize Disk." Do not do this. Initializing the disk will write a new partition table, overwriting the RAID metadata and potentially making data recovery impossible. Click Cancel and use RAID recovery software instead.

Recovery Path for Missing Array

Missing Array — Response Sequence

Do not write to any drive in the array. Treat all drives as read-only until recovery is complete or confirmed impossible.

Verify all drives are physically present and responding. Check BIOS/RAID BIOS utility to confirm each drive is detected. Re-seat any drives not showing up.

Check the RAID controller utility. Access the RAID management interface (during POST, or via OS utility) to see the array status. It may show the array as "offline" rather than truly missing — bringing it online may be simple.

Attempt array import. Many RAID controllers allow importing a foreign configuration — if drives were moved from another controller of the same type, the array metadata can often be imported.

Use RAID recovery software. Tools like R-Studio or UFS Explorer can scan drives, detect RAID parameters, and reconstruct the virtual array for data extraction without modifying the original drives.

SYMPTOM 12Audible Alarms

Audible alarms in the context of drive and RAID issues refer to electronic beeping sounds generated by RAID controllers, server management hardware, or UPS (Uninterruptible Power Supply) units — not the mechanical sounds from HDD heads described in Symptoms 02 and 03. These are intentional alert sounds produced by system management electronics to notify an administrator of a fault condition.

Sources and Meanings

Source	Alarm Pattern	Meaning
RAID controller	Continuous or repeating beep	Drive failure detected in array; array may be degraded or failed
Server IPMI / BMC	Intermittent beeping + LED	Hardware fault detected; check management console for specific error code
NAS device	Beeping + front panel indicator	Drive failure, RAID degraded, or network connectivity loss
UPS (APC, Eaton)	Continuous beep	Running on battery power (power outage); battery low
UPS	4 beeps repeating	Battery near end of life — replace battery
Storage enclosure	Alarm + amber LED	Drive slot has a failed drive; hot-swap replacement required

Response to RAID Audible Alarms

In a server or NAS environment, the alarm is designed to be heard and acted upon immediately. The typical response sequence is:

Audible Alarm Response — RAID Context

Identify the source. Is the alarm from the server itself, a separate UPS, or a NAS enclosure? Each requires a different response.

Check LED indicators. The amber/red LED on the failing drive bay will identify which specific drive has failed. Do not guess — verify by LED before pulling any drive.

Access the RAID management console. The RAID controller's management software (e.g., HP SSA, Dell OMSA, LSI MegaRAID Storage Manager) shows the array status, which drive failed, and the current rebuild status.

Silence the alarm if needed. Most RAID utilities allow silencing the audible alarm while leaving the visual alert active. This is appropriate in environments where the alarm is disruptive but the fault is already being addressed.

Replace the failed drive. If the array supports hot-swap, replace the failed drive while the system is running. Insert the replacement drive; the controller will automatically begin rebuilding.

Monitor the rebuild. Watch the rebuild progress. Do not power off the system during rebuild unless absolutely necessary. Verify the array returns to optimal status after rebuild completes.

SYMPTOM 13S.M.A.R.T. in RAID Environments

One subtle but important topic is how S.M.A.R.T. interacts with RAID. S.M.A.R.T. is a per-drive metric — it monitors each individual physical drive's health independently. RAID controllers vary in how they expose or suppress this data.

Hardware RAID controllers often abstract the physical drives away from the OS. The OS sees only the logical RAID volume, not the individual drives. S.M.A.R.T. data from individual drives may only be accessible through the RAID controller's management software — not through standard OS tools like CrystalDiskInfo.
Software RAID (Windows Storage Spaces, Linux mdadm) leaves the individual drives visible to the OS, so S.M.A.R.T. data is accessible normally.
Proactive monitoring matters — in a RAID 5 array, one drive may be accumulating S.M.A.R.T. errors silently while the array still appears healthy. If that drive fails and a second drive fails during the subsequent rebuild, all data is lost. Regular S.M.A.R.T. monitoring on individual RAID drives is critical preventive maintenance.

Master Reference — All 13 Symptoms

LED status indicatorsAmber/yellow = fault; green = healthy; blinking = activity or rebuild

Grinding noisesHDD head crash or motor bearing failure — power off immediately

Clicking soundsHDD click of death — head positioning failure; back up now

Bootable device not foundCheck BIOS boot order first; then bootloader; then drive hardware

Data loss / corruptionBad sectors, sudden power loss, file system error, or malware

RAID failureExceeded fault tolerance — degrade vs. failed; check RAID level

S.M.A.R.T. failurePredictive warning — back up immediately, then replace drive

Extended read/write timesBad sectors (HDD), NAND wear (SSD), or NVMe thermal throttling

Low IOPSDrive degradation; HDDs far lower than SSDs by design

Missing drives in OSCheck BIOS → Disk Management → cables → test in another system

Array missingController failure, config deleted, or too many drives failed

Audible alarmsElectronic alert from RAID controller or NAS — check LED, access mgmt console

REFERENCEDiagnostic Tools for Drive Troubleshooting

Windows Tools

CrystalDiskInfo — S.M.A.R.T. health reader; color-coded status
CrystalDiskMark — drive speed benchmark; measures IOPS
Disk Management (diskmgmt.msc) — partition and volume management
chkdsk /f /r — file system repair and bad sector scan
diskpart — partition table inspection and repair
bootrec /rebuildbcd — boot record repair
Recuva — deleted file recovery
Windows Memory Diagnostic — rules out RAM as cause

Bootable / External Tools

GParted Live — partition editor; works on non-booting systems
TestDisk — partition table and boot sector recovery
PhotoRec — file carving recovery (works even without partition table)
ddrescue (Linux) — forensic imaging of failing drives
WinRE / Windows Recovery — Startup Repair, bootrec access
Manufacturer tools — SeaTools (Seagate), Western Digital Diagnostics, Samsung Magician
R-Studio / UFS Explorer — RAID recovery and advanced data recovery

Final Exam Reminders

Grinding or clicking = HDD mechanical failure. Power off. Back up. Replace.

S.M.A.R.T. failure at POST = prediction, not current failure. Back up immediately, then replace drive.

Bootable device not found = check BIOS boot order first. Then bootloader. Then cables.

RAID 0 = any single failure = all data lost. Zero redundancy.

RAID 5 degraded = still running but one more failure = total loss. Replace failed drive urgently.

Array missing prompt to initialize = do NOT initialize. Use recovery software.

Audible alarm from RAID = check LED on drive bay, access RAID management console, hot-swap failed drive.

RAID is not a backup = it protects against hardware failure only, not deletion, ransomware, or controller failure.

Troubleshooting Driveand RAID Issues

OVERVIEWIntroduction to Storage Troubleshooting

FOUNDATIONStorage Technologies Overview

FOUNDATIONRAID Levels — Deep Reference

RAID Levels Compared

RAID Visualized

Data Layout by RAID Level

RAID States

SYMPTOM 01LED Status Indicators

Common LED Patterns

Drive Activity vs. Drive Health LEDs

SYMPTOM 02Grinding Noises

What Causes Grinding

SYMPTOM 03Clicking Sounds

Causes of HDD Clicking

SYMPTOM 04Bootable Device Not Found

Cause Hierarchy — Most to Least Likely

Windows Boot Repair Tools

SYMPTOM 05Data Loss / Corruption

Causes of Data Loss and Corruption

Physical / Hardware Causes

Logical / Software Causes

Diagnosing and Recovering Data

SYMPTOM 06RAID Failure

Failure Thresholds by RAID Level

RAID Rebuild Process

SYMPTOM 07S.M.A.R.T. Failure

How S.M.A.R.T. Works

Critical S.M.A.R.T. Attributes

Reading S.M.A.R.T. Data

SYMPTOM 08Extended Read/Write Times

Causes by Drive Type

SYMPTOM 09Low Performance IOPS

Typical IOPS by Drive Type

Causes of Low IOPS

SYMPTOM 10Missing Drives in OS

Diagnostic Decision Tree

SYMPTOM 11Array Missing

Causes of a Missing Array

Recovery Path for Missing Array

SYMPTOM 12Audible Alarms

Sources and Meanings

Response to RAID Audible Alarms

SYMPTOM 13S.M.A.R.T. in RAID Environments

Master Reference — All 13 Symptoms

REFERENCEDiagnostic Tools for Drive Troubleshooting

Windows Tools

Bootable / External Tools

Troubleshooting Drive
and RAID Issues