Question test HDD files

nandobadam · Jan 1, 2025

nageme said:
Isso não é apenas fragmentação?

Seriamente?

yes

1 year without use loses magnetization, mechanical and oxidation problems?

Do HDDs also use flash memory to retain firmware and BIOS? Is this a problem since flash memory is not recommended for archiving?

Is it necessary to test annually or every two months to check for any type of corruption?

Is a 2.5" HDD more likely to die when stored or in constant operation?

nageme · Jan 1, 2025

Vincero said:
Not for one set of archive zips that are written in one go to an empty drive...

I can't say for sure, but I'd expect ECC action to be transparent, except when it needs to retry reading sectors.
If you see a slowdown in the future, it would be interesting and useful to check the fragmentation there, and compare pre and post the SMART stats for reallocated or pending sectors.

Vincero said:
This was the drive needing to take a bit longer to read and the ECC engine to decide what is the proper data output.

...long-term storage tests on new SSD drives after say 6-12 months

But flash is a different beast than HDDs.
I don't recall seeing anyone doing retention tests. More difficult to test, and requires long term commitment. But that would be interesting indeed.

Vincero · Jan 1, 2025

nageme said:
I think it goes together: more reliable = more cycles. Presumably NOR, and maybe SLC rather than MLC.
For example, MX25L6433F might be a common BIOS flash chip (I'm unsure, but I think it's the ballpark): 20 year retention, 100K cycles.

Good to see it's that good - never realised they had hit such high numbers since the early days - probably good considering the amount of attribute data they need to process / handle and store.

nageme · Jan 1, 2025

@Vincero
I don't think there's much writing going on. I could be wrong, but I assume once per boot, or at worst just every now and then throughout the day.
AFAIK, unlike NAND, NOR can be written to a byte at a time, so there's less "write amplification".
While erasure still happens on a block level, I assume it would be handled intelligently. Such as, use up all available space and only then erase.

Vincero · Jan 1, 2025

nageme said:
I can't say for sure, but I'd expect ECC action to be transparent, except when it needs to retry reading sectors.
If you see a slowdown in the future, it would be interesting and useful to check the fragmentation there, and compare pre and post the SMART stats for reallocated or pending sectors.

Flash is a different beast.
I don't recall seeing anyone doing retention tests. More difficult to test, and requires long term commitment. But that would be interesting indeed.

That was from a flash drive - with no fragmentation (checked with contig command just to make sure). TLC based.

And yeah, for sure the ECC actions are transparent but if a retry is needed there is an instant performance hit (obviously more so for a spinning HDD where the drive electronics need to maybe wait for the data to come back around again - people forget that modern drives *may* still use interleaving vs being set/controlled from BIOS).

* Corrected with respect to @nageme comments

nageme · Jan 1, 2025

Right. I thought we're talking about HDDs, which is the thread's topic.
And people were saying stuff like the following, which doesn't seem likely:

AusWolf said:
the surface of unused HDDs can get demagnetised, leading to data corruption. It's probably enough to power them on every couple of months

SSDs are a different discussion, and I wouldn't use them for archival storage.

Vincero said:
people forget that ALL modern drives now use interleaving

As in 1980s floppy style? I don't think so.
Data is stored linearly (except remapped sectors, if areas got physically corrupt over time).

Vincero · Jan 1, 2025

nageme said:
Right. I thought we're talking about HDDs, which is the thread's topic.
And people were saying:

SSDs are a different discussion, and I wouldn't use them for archival storage.

That's fair enough - but I wouldn't treat the methodology any differently be it SSD or HDD.

The causes of problems are different but the things to look for are the same, e.g. bit-rot can happen, and performance dips are not a good thing and a sign of trouble - either way SMART attributes should be checked also.
I do have an archive 8TB HDD but I literally just updated that so a) am not expecting to see any difference, b) it's slower to go through because HDD, and c) more hassle to pull and deal with.
Just so happen to have an SSD that's been in cold storage for >6 months and to a certain extent it highlights a not ideal scenario.

The issue I've shown can happen (and has happened before for me) with HDDs.... one reason I was not too sad when Samsung exited the HDD business / sold it to Seagate. Although, to their credit, they did RMA the drive (because it reallocated sectors retrieving one file and corrupted it - apart from that one bit the drive was fine so likely a media defect).

I've also seen server grade (IBM U320) drives loose their contents after years in storage - wasn't an issue (fortunately) as the contents were never needed.

AusWolf · Jan 1, 2025

nageme said:
Right. I thought we're talking about HDDs, which is the thread's topic.
And people were saying stuff like the following, which doesn't seem likely:

SSDs are a different discussion, and I wouldn't use them for archival storage.

I was talking about HDDs (read my post again).

Vincero · Jan 1, 2025

nageme said:
As in 1980s floppy style? I don't think so.
Data is stored linearly (except remapped sectors, if areas got physically corrupt over time).

As in 1980s MFM/RLL style?? Yes, but its all transparent.
In many cases it's 1:1 (i.e. no interleave), but some drives vary how it's used.
Some datasheets still make reference to it, e.g. Seagate Exos SAS drives (although they only state minimum and not maximum).
Making an assumption that if it wasn't needed at all they would state no interleaving or just 1:1 - not mention a 'minimum'.

I remember reading that some drives can use different interleaving at different points on the disk to optimise the drive electronics read speed (as HDDs don't vary rotation speed). I doubt todays drives would operate at anything worse that 1:2.

nageme · Jan 1, 2025

Vincero said:
Some datasheets still make reference to it, e.g. Seagate Exos SAS drives (although they only state minimum and not maximum).
Making an assumption that if it wasn't needed at all they would state no interleaving or just 1:1 - not mention a 'minimum'.

Huh. Curious.
Still, I think there's nothing but 1:1, even in "exotic" Enterprise drives.
Did you see anywhere explicitly mentioning more than 1?
And does "minimum" make sense at all, since how can it be less than 1?

I suspect it's bad wording and a leftover text template from bygone days.
For example, in the Product Manual of a 1996 Seagate 2.1GB SCSI drive, Hawk 2XL:

4.2.3 Generalized performance characteristics
Minimum Sector Interleave (all Hawk 2XL models) 1 to 1

And it also mentions speeds as "MByte/sec divided by (Interleave Factor)".

But even for that old drive it also says:

3.1 Standard features
...
* 1:1 Interleave

AusWolf said:
I was talking about HDDs (read my post again).

I know.
It wasn't a reply to you, but rather a conversation line with Vincero where I said this thread is about HDDs, and your post was an example.

Vincero · Jan 1, 2025

nageme said:
Huh. Curious.
Still, I think there's nothing but 1:1, even in "exotic" Enterprise drives.
Did you see anywhere explicitly mentioning more than 1?
And does "minimum" make sense at all, since how can it be less than 1?

I suspect it's bad wording and a leftover text template from bygone days.

Possibly / Probably... But the ambiguity and the wording of the statement leaves the implication that either the drives can implement a varying approach, or reserved for extremely data dense drives and high spin speeds.
The data sheets no longer list this item per each drive model. Or maybe thee makers just reserve the right to use it where they want as part of their bag of tricks to get the most out of a model.

Certainly, I would struggle to imagine having interleaving skipping sectors in a reliable way when used with SMR or other potentially overlapping data areas.

nandobadam · Jan 1, 2025

1 year without use loses magnetization, mechanical and oxidation problems?

Do HDDs also use flash memory to retain firmware and BIOS? Is this a problem since flash memory is not recommended for archiving?

Is it necessary to test annually or every two months to check for any type of corruption?

Is a 2.5" HDD more likely to die when stored or in constant operation?

GerKNG · Jan 1, 2025

Bit rot is very real after several years and if you really want to archive data for a very long time and it is important, buy another drive and copy one to the other to write the data fresh and keep the old one until you overwrite it. I'd do this once a year personally.
i had older HDDs on a shelf (anti static bag, cardboard box) and a lot of files were corrupted even on ironwolfs within 2-3 Years.

A Computer Guy · Jan 1, 2025

nandobadam said:
Is it necessary to test annually or every two months to check for any type of corruption?

If you are concerned you could check the integrity of your files by doing a binary compare on each file using a tool such as Beyond Compare with each of your drives.

Depending on how your files are organized compare from the source or designate one of your backups as the primary and compare that against each copy. Since they are USB drives make sure you have enough USB ports on your motherboard to host all the drives at once you will be comparing to minimize the chance of issues with the USB devices as I found using USB hubs can be problematic with USB drives and a high amount of I/O.

Run multiple compares at the same time to save time but since you infrequently access the drives I would say the time investment to let the compare run for a day or two (depending on how much data you need to validate) would put your mind at ease about the integrity of your files between devices and exercise your drives sufficiently to know if they are near physically failing so you won't be surprised later.

If you find an inconsistency with a file you then can compare against 4 of your devices to determine the correct copy and repair or replace the drive with confidence you have the undamaged copy.

This is of course only one way to deal with verifying multiple copies as there are other strategies but this is simple and cost friendly. I do this with my backups every now an then from my NAS (operating on RAID6 for redundancy with periodic scrubbing to check for bit rot) to ensure my raw backups (not managed by a system that is designed to validate file integrity) are intact. This of course places a high degree of trust on my NAS to ensure files are not corrupted. In my case if I find a problem I have several means to determine how I can recover a file since I have both kinds of backups (automatically verified vs. the NAS backup software and manually verified with my raw backups and Beyond Compare)

nandobadam said:
Is a 2.5" HDD more likely to die when stored or in constant operation?

"If you don't use it you lose it" vs. "wear and tear" in either case hardware degrades over time. You have 4 copies (4 drives) of the same files which I assume is your mitigation if you should discover a drive fails or if you discover you have a corrupted file.

nandobadam · Jan 1, 2025

Your files were corrupted because you didn't use the HDD for 1 year?

I don't have much technical knowledge, I just connect the HDD + USB 3.0 case to the PC and try to perform some test on all the files compressed in rar, 7z and zip to find out if they are intact, when a compressed file is created a hash or code is generated that remains saved inside the file if something gets corrupted the code changes and some software says that the file is corrupted?

eidairaman1 · Jan 1, 2025

The last hdd that i had a problem with was ibm deskstar: they got labled as deathstar because of how horrible they would fail with the click of doom. Hitachi bought the line and it was a complete 180. Anyways i notice you have multiple threads relating to basically the same problem and I believe you are overthinking on it. Stop it

A Computer Guy · Jan 1, 2025

nandobadam said:
Your files were corrupted because you didn't use the HDD for 1 year?

I've never encountered file corruption from long term idle drives so I can't really answer your concern. I have the same concern with SSD based drives as well but I don't use those for long term storage due to the cost and capacity restraints. You have 4 HDD devices so you could do a test and reserve one device for such a test but HDD storage is reliable as long as you don't buy junk drives. The last time I was aware I even had a problem with HDD's was either immediate device failure or way back in the days of RLL/MFM drives.

Using the Beyond Compare method I described did enable me to detect many years ago when a forced Windows 10 update crippled my RAID6 driver stability on my file server and I found it began writing zeros to my backups during file copy. Just goes to show reliability of your source device is paramount and being able to detect problems is also important so you ensure your backups are reliable in case you do need to perform a recovery.

nandobadam said:
I don't have much technical knowledge, I just connect the HDD + USB 3.0 case to the PC and try to perform some test on all the files compressed in rar, 7z and zip to find out if they are intact, when a compressed file is created a hash or code is generated that remains saved inside the file if something gets corrupted the code changes and some software says that the file is corrupted?

That is one way to produce backups and check for corruption but relies more heavily on having multiple copies for recovery because if you have a problem, trying to partially recover files from a corrupted compressed and/or encrypted archive might be difficult or impossible. It is much easier and generally faster to copy around larger compressed files to USB than deal with the filesystem overhead of thousands of small files.

nandobadam · Jan 1, 2025

My concern is to turn on the HDD after an interval (two months, year) and test if all the files that are compressed in RAR, ZIP and 7Z are still readable without corruption and also to prevent the HDD from demagnetizing.

Does WinRAR have any function to test these compressed files for corruption?

Onasi · Jan 1, 2025

nandobadam said:
Does WinRAR have any function to test these compressed files for corruption?

Again, you cannot test for corruption without anything to compare WITH. You need either a copy of the files or a checksum to compare against. Nothing will tell you if the files are corrupted or not in sheer vacuum.

A Computer Guy · Jan 1, 2025

nandobadam said:
My concern is to turn on the HDD after an interval (two months, year) and test if all the files that are compressed in RAR, ZIP and 7Z are still readable without corruption and also to prevent the HDD from demagnetizing.

Generally I don't think you need to be concerned about demagnetizing as a specific use case.

Minimally
1) keep a sufficient number of backup copies for recovery
2) test your archives periodically by unzipping them somewhere

nandobadam said:
Does WinRAR have any function to test these compressed files for corruption?

The best way to test your archives is to unzip your archives. If a problem occurs the archive software will tell you. Some archive software has a function to validate the archives based on their internal checksums.

Vincero · Jan 1, 2025

eidairaman1 said:
The last hdd that i had a problem with was ibm deskstar: they got labled as deathstar because of how horrible they would fail with the click of doom. Hitachi bought the line and it was a complete 180. Anyways i notice you have multiple threads relating to basically the same problem and I believe you are overthinking on it. Stop it

AHH yes, the (in)famous glass platter drives...

Hard for the drive to read data tracks when they are literally gone...

eidairaman1 · Jan 1, 2025

Vincero said:
AHH yes, the (in)famous glass platter drives...

Hard for the drive to read data tracks when they are literally gone...
View attachment 378001

Yup all over the place in the drive, like brake dust in a drum or crummy pads on disk brakes. Hitachi was awesome, too bad wd scooped them up...

Wirko · Jan 1, 2025

A Computer Guy said:
If you are concerned you could check the integrity of your files by doing a binary compare on each file using a tool such as Beyond Compare with each of your drives.

Full binary compare is very important. My primitive backup strategy is to do a full backup every two months. Then I delete the files from the previous backup set if they are binary identical to those in the new backup set. The program I use is CloneSpy. It's painfully slow process, sure. But I consider it a pretty good mitigation against many causes of data loss, including:
- data transmission errors when copying (if a file exists both in the new and the old backup sets)
- bit rot of the original working files, not just backups, even if the corruption isn't ever noticed.
On top of that, this offers some amount of versioning, as non-identical files are kept forever.

A Computer Guy said:
The best way to test your archives is to unzip your archives. If a problem occurs the archive software will tell you. Some archive software has a function to validate the archives based on their internal checksums.

7zip can validate checksums of zip, 7z and rar files without unzipping. All those formats have at least CRC-32 internal checksums.

nageme · Jan 1, 2025

Wirko said:
Full binary compare is very important.

Hashes are enough, and more practical.
If someone's uneasy with just CRC32, then MD5, SHA1, or others.

7zip can validate checksums of zip, 7z and rar files without unzipping. All those formats have at least CRC-32 internal checksums.

As can WinRAR, and practically any other compressor in the last 30-40 years (including DOS ones such as ARJ, PKZIP, LHarc, and even ARC.)

Shrek · Jan 1, 2025

Vincero said:
AHH yes, the (in)famous glass platter drives...

Hard for the drive to read data tracks when they are literally gone...
View attachment 378001

Intriguing, I would have expected the outer coating to hold up better as the heads fly higher there. Or maybe the heads were bouncing when de-parking.

System Name	My second and third PCs are Intel + Nvidia
Processor	AMD Ryzen 7 7800X3D @ 45 W TDP Eco Mode
Motherboard	MSi Pro B650M-A Wifi
Cooling	Noctua NH-U9S chromax.black push+pull
Memory	2x 24 GB Corsair Vengeance DDR5-6000 CL36
Video Card(s)	PowerColor Reaper Radeon RX 9070 XT
Storage	2 TB Corsair MP600 GS, 4 TB Seagate Barracuda
Display(s)	Dell S3422DWG 34" 1440 UW 144 Hz
Case	Corsair Crystal 280X
Audio Device(s)	Logitech Z333 2.1 speakers, AKG Y50 headphones
Power Supply	750 W Seasonic Prime GX
Mouse	Logitech MX Master 2S
Keyboard	Logitech G413 SE
Software	Bazzite (Fedora Linux) KDE Plasma

Processor	AMD Ryzen 9 9950X3D
Motherboard	ASRock B850M PRO-A
Cooling	Corsair Nautilus 360 RS
Memory	2x32GB Kingston Fury Beast 6000 CL30
Video Card(s)	PowerColor Hellhound RX 9070 XT
Storage	1TB Samsung 990 Pro, 2TB Samsung 990 Pro, 4TB Samsung 990 Pro
Display(s)	LG 27GS95QE-B, MSI G272QPF E2
Case	Lian Li DAN Case A3 Black Wood Edition
Audio Device(s)	Bose Companion Series 2 III, Sennheiser GSP600 and HD599 SE - Creative Soundblaster X4
Power Supply	Corsair RM1000X ATX 3.1
Mouse	Razer Deathadder V3
Keyboard	Razer Black Widow V3 TKL
VR HMD	Oculus Rift S

System Name	Still not a thread ripper but pretty good.
Processor	Ryzen 9 7950x, Thermal Grizzly AM5 Offset Mounting Kit, Thermal Grizzly Extreme Paste
Motherboard	ASRock B650 LiveMixer (BIOS/UEFI version P3.08, AGESA 1.2.0.2)
Cooling	EK-Quantum Velocity, EK-Quantum Reflection PC-O11, D5 PWM, EK-CoolStream PE 360, XSPC TX360
Memory	V-Color DDR5 96GB (48GBx2) 6400MHz CL52 2Rx8 ECC Unbuffered DIMM 1.1v (TE548G64D852K) + JONSBO NF-1
Video Card(s)	XFX Radeon RX 5700 & EK-Quantum Vector Radeon RX 5700 +XT & Backplate
Storage	Samsung 4TB 980 PRO, 2 x Optane 905p 1.5TB (striped), AMD Radeon RAMDisk
Display(s)	2 x 4K LG 27UL600-W (and HUANUO Dual Monitor Mount)
Case	Lian Li PC-O11 Dynamic Black (original model)
Audio Device(s)	Corsair Commander Pro for Fans, RGB, & Temp Sensors (x4)
Power Supply	Corsair RM750x
Mouse	Logitech M575
Keyboard	Corsair Strafe RGB MK.2
Software	Windows 10 Professional (64bit)
Benchmark Scores	RIP Ryzen 9 5950x, ASRock X570 Taichi (v1.06), 128GB Micron DDR4-3200 ECC UDIMM (18ASF4G72AZ-3G2F1)

System Name	PCGOD
Processor	AMD FX 8350@ 5.0GHz
Motherboard	Asus TUF 990FX Sabertooth R2 2901 Bios
Cooling	Scythe Ashura, 2×BitFenix 230mm Spectre Pro LED (Blue,Green), 2x BitFenix 140mm Spectre Pro LED
Memory	16 GB Gskill Ripjaws X 2133 (2400 OC, 10-10-12-20-20, 1T, 1.65V)
Video Card(s)	AMD Radeon 290 Sapphire Vapor-X
Storage	Samsung 840 Pro 256GB, WD Velociraptor 1TB
Display(s)	NEC Multisync LCD 1700V (Display Port Adapter)
Case	AeroCool Xpredator Evil Blue Edition
Audio Device(s)	Creative Labs Sound Blaster ZxR
Power Supply	Seasonic 1250 XM2 Series (XP3)
Mouse	Roccat Kone XTD
Keyboard	Roccat Ryos MK Pro
Software	Windows 7 Pro 64

System Name	Still not a thread ripper but pretty good.
Processor	Ryzen 9 7950x, Thermal Grizzly AM5 Offset Mounting Kit, Thermal Grizzly Extreme Paste
Motherboard	ASRock B650 LiveMixer (BIOS/UEFI version P3.08, AGESA 1.2.0.2)
Cooling	EK-Quantum Velocity, EK-Quantum Reflection PC-O11, D5 PWM, EK-CoolStream PE 360, XSPC TX360
Memory	V-Color DDR5 96GB (48GBx2) 6400MHz CL52 2Rx8 ECC Unbuffered DIMM 1.1v (TE548G64D852K) + JONSBO NF-1
Video Card(s)	XFX Radeon RX 5700 & EK-Quantum Vector Radeon RX 5700 +XT & Backplate
Storage	Samsung 4TB 980 PRO, 2 x Optane 905p 1.5TB (striped), AMD Radeon RAMDisk
Display(s)	2 x 4K LG 27UL600-W (and HUANUO Dual Monitor Mount)
Case	Lian Li PC-O11 Dynamic Black (original model)
Audio Device(s)	Corsair Commander Pro for Fans, RGB, & Temp Sensors (x4)
Power Supply	Corsair RM750x
Mouse	Logitech M575
Keyboard	Corsair Strafe RGB MK.2
Software	Windows 10 Professional (64bit)
Benchmark Scores	RIP Ryzen 9 5950x, ASRock X570 Taichi (v1.06), 128GB Micron DDR4-3200 ECC UDIMM (18ASF4G72AZ-3G2F1)

Question test HDD files

nandobadam

nageme

Vincero

nageme

Vincero

nageme

Vincero

AusWolf

Vincero

nageme

Vincero

nandobadam

GerKNG

A Computer Guy

nandobadam

eidairaman1

The Exiled Airman

A Computer Guy

nandobadam

Onasi

A Computer Guy

Vincero

eidairaman1

The Exiled Airman

Wirko

nageme

Shrek

System Name	The Workhorse
Processor	AMD Ryzen R9 5900X
Motherboard	Gigabyte Aorus B550 Pro
Cooling	CPU - Noctua NH-D15S Case - 3 Noctua NF-A14 PWM at the bottom, 2 Fractal Design 180mm at the front
Memory	GSkill Trident Z 3200CL14
Video Card(s)	NVidia GTX 1070 MSI QuickSilver
Storage	Adata SX8200Pro 1 TB
Display(s)	LG 32GK850G
Case	Fractal Design Torrent (Solid)
Audio Device(s)	Sennheiser HD598, FiiO E-10K DAC/AMP, Samson Meteorite USB Microphone
Power Supply	Corsair RMx850 (2018)
Mouse	Zaopin Z1 Pro on a X-Raypad Heavy Bee Redtail
Keyboard	Cooler Master QuickFire Rapid TKL (Cherry MX Black)
Software	Windows 11 Pro (24H2)

Processor	i5-6600K
Motherboard	Asus Z170A
Cooling	some cheap Cooler Master Hyper 103 or similar
Memory	16GB DDR4-2400
Video Card(s)	IGP
Storage	Samsung 850 EVO 250GB
Display(s)	2x Oldell 24" 1920x1200
Case	Bitfenix Nova white windowless non-mesh
Audio Device(s)	E-mu 1212m PCI
Power Supply	Seasonic G-360
Mouse	Logitech Marble trackball, never had a mouse
Keyboard	Key Tronic KT2000, no Win key because 1994
Software	Oldwin

System Name	CyberPowerPC ET8070
Processor	Intel Core i5-10400F
Motherboard	Gigabyte B460M DS3H AC-Y1
Memory	2 x Crucial Ballistix 8GB DDR4-3000
Video Card(s)	MSI Nvidia GeForce GTX 1660 Super
Storage	Boot: Intel OPTANE SSD P1600X Series 118GB M.2 PCIE
Display(s)	Dell P2416D (2560 x 1440)
Power Supply	EVGA 500W1 (modified to have two bridge rectifiers)
Software	Windows 11 Home