Hard Drive Reliability Review for 2015

crispyambulance · on Feb 16, 2016

This is interesting...

"A relevant observation from our Operations team on the Seagate drives is that they generally signal their impending failure via their SMART stats. Since we monitor several SMART stats, we are often warned of trouble before a pending failure and can take appropriate action. Drive failures from the other manufacturers appear to be less predictable via SMART stats."

~10 years ago, I remember google research put out a highly cited paper wherein they found that SMART stats were not a particularly strong indicator of impending drive failure (50% of drives had no SMART indications of problem before failure). http://research.google.com/pubs/pub32774.html

Has this now changed (at least for Seagate)?

Reliability/longevity is nice but a signal of impending failure is far more valuable from an operations point of view.

budmang · on Feb 16, 2016

From our experience, we've found 5 SMART stats that are useful in predicting failure: https://www.backblaze.com/blog/hard-drive-smart-stats/

Many SMART stats aren't particularly useful in predicting failure as they simply correlate to the age of the drive in some fashion.

Also, here is our data on every single SMART stat for all of the drives we have: https://www.backblaze.com/blog-smart-stats-2014-8.html

Gleb (CEO, Backblaze)

samstave · on Feb 16, 2016

Gleb,

First thanks for all your company's sharing of such data, as well as the pod open platform. Kudos.

Second, can you do a Writeup specifically and only about SSDs?

Thanks

atYevP · on Feb 16, 2016

Hi! Yev from Backblaze here -> Yes, we only report the stats of what we have in our environment. As much as we'd love to have a test of SSDs in a pod (augmented for SSDs of course) they're just not feasible from a cost per GB perspective. Hopefully sometime though :)

geostyx · on Feb 16, 2016

As far as I know, they don't really use SSD's because of the higher cost/GB. So they probably don't have much to say about them :/

ekianjo · on Feb 17, 2016

Is there any good use case of having ssds in a data center, if you did not care about cost?

masklinn · on Feb 17, 2016

Input/Output rate, bandwidth and IO roundtrip delay.

* even the slowest SSDs have significantly higher I/O rates than the best mechanical drives, and the comparison between best-in-class mechanical and enterprise-class PCIe SSDs is just ridiculous: a 15K SAS drive will do 200 IOPS, a high end SSD will do a million

* 15K SAS drives will top out around 250MB/s on bulk sequential reads (that's a best-case scenario), high-end PCIe SSD are in the 2.5GB/s range

* HDDs have a latency of 10~20ms, SSDs have a latency of 100~200µs (RAM has a latency of ~100ns)

hrrsn · on Feb 17, 2016

They're used in DCs plenty when speed is required

jen729w · on Feb 17, 2016

Yep, I work for a very large government department here in AU and we have a tiny bit of SSD in our DC for the stuff that really needs it.

It's probably not 5% of our total storage, though.

samstave · on Feb 17, 2016

samstave · on Feb 16, 2016

They had some in some of their other reports but too small a population it would seem...

sandGorgon · on Feb 17, 2016

This is awesome!

Have you productized these learnings in a a powertop-like tool for Linux?

Smartmontools are not intuitive enough for the layman to use in any meaningful way.. And backblaze has really built some serious learning here that could be of use to everyone.

agumonkey · on Feb 17, 2016

Will this turn into a tradition ?

wyldfire · on Feb 16, 2016

> Has this now changed (at least for Seagate)?

I suspect rotating drives have a variety of several failure modes, some of which could be predicted by SMART, others which it's unlikely to be predicted.

Each new model is probably bound to have a different pareto of failure modes.

nfriedly · on Feb 16, 2016

Now, if only Seagate had human-readable SMART values.

(I say this as I've been recently built a freeNAS box with a combination of Seagate NAS and WD Red HDDs - the WD's make it easy to look at the smart stats and know what's going on. The Seagate ones, not so much.)

roddux · on Feb 16, 2016

HGST (or Hitachi Global Storage Technologies) are again topping the charts for drive reliability! They must be doing something right.

Also, the fact that backblaze are publishing most of their data online is very cool.

currysausage · on Feb 16, 2016

HGST, formerly Hitachi Global Storage Technologies, part of WD as of 2012. The cool thing about them was that their consumer Deskstars were at least as reliable as enterprise disks by other manufacturers. I still have 12-year-old PCs here with HGST 80 and 160 GB drives that were subject to daily use and a lot of inappropriate handling. The Deskstars don't mind.

Very unfortunately, HGST has apparently scaled back Deskstar sales and development significantly since the acquisition. I guess it has to do with WD selling off some of HGST's 3.5" assets to Toshiba in order to appease competition authorities. See also https://news.ycombinator.com/item?id=10057519

frik · on Feb 16, 2016

Yet:

"In May 2012, WD divested to Toshiba assets that enabled Toshiba to manufacture and sell 3.5-inch hard drives for the desktop and consumer electronics markets to address the requirements of regulatory agencies."

https://en.wikipedia.org/wiki/HGST

currysausage · on Feb 16, 2016

Added link to an older comment of mine that addresses the HGST/Toshiba thing. To me, it looks like newer Toshiba 3.5" models are based on Fujitsu tech (if the enclosure design is any indication). Also, Toshiba might abandon their HDD business completely. [1]

[1] http://seekingalpha.com/article/3827746-seagate-western-digi...

rasz_pl · on Feb 16, 2016

Fujitsu :( I worked at Fujitsu wholesale distributor around 1999, EVERY SINGLE drive sold between 1999-2001 died within 3 years (PB15/PB16). Those were great drives, cheap, silent, fast, and smelled great fresh from the factory due to pine sap rosin.

Allegedly Cirrus Logic controller had a manufacturing defect and died due to heat. Myself I always suspected that very peculiar and strong smelling rosin flux. PCB was drenched in it, this type of flux is usually highly activated and requires cleaning, otherwise acid will eat solder joints and copper away, especially in humid and hot environments.

sharpy · on Feb 16, 2016

That's an interesting cycle.

I had one of the IBM Deskstar (aka. Deathstar for the high failure rate). IBM sold the HDD business to Hitachi, who sold it to WD.

And now, Deskstars are as reliable as they come.

rogerbinns · on Feb 16, 2016

I had 4 of the 3 IBM "Deathstars" I bought fail. Yes, all 3 failed and then one of the replacements failed too.

justin66 · on Feb 17, 2016

I had one fail, got a replacement, and with the replacement and its replacements continued that cycle until a new generation of drives came out, at which point I sold the stupid thing on eBay. It was unreal.

rpgmaker · on Feb 16, 2016

Conversely WD, particularly their RED drives, have gone to shit. Whatever HGST is doing they must be doing it in spite of WD management.

tw04 · on Feb 17, 2016

The reason HGST drives are still so reliable is because 4 years later the merger still hasn't closed. We can thank China for that.

http://www.theregister.co.uk/2015/10/19/mofcom_says_yes_wd_h...

Of course, the reality is China wanted a piece of WD, and used the merger as leverage to get it. I would expect by 2017, HGST drives are just as shit as WD. Which is unfortunate, because the Japanese designed one hell of a hard drive.

currysausage · on Feb 17, 2016

Thank you for sharing this! I actually was naive enough to believe that WD would continue to let HGST operate as a seperate entity.

There is probably some good news in the article though, for what it's worth: "At that time John Cyne ran WD. He since retired, with HGST bss Steve Milligan taking on his job." ... "In other news Western Digital has announced a new executive management team, and it looks almost like HGST executed a reverse take-over of Western Digital." ... "A person who was close to the corporate action in Western Digital and HGST said: 'All key positions are with HGST people; it's a reverse buyout. First HGST took Coyne's money to buy themselves (probably with a clause that Milligan is becoming CEO) and then they watched WD dismantling itself.'"

JoshGlazebrook · on Feb 16, 2016

The 400GB Hitachi Deskstar in my old old Dell Desktop (Dimension 9100) so think early 2005, was still going strong before the power supply in that desktop died in 2013 or so. It had about 20 bad sectors according to SMART, but it still was chugging along.

cm2187 · on Feb 16, 2016

Plus the Deskstars were compatible with hardware RAID.

MCRed · on Feb 16, 2016

Quantum was the premiere maker of SCSI drives back in the day. They were beating IBM and IBM needed more capacity so IBM bought them. Then IBM sold to Hitachi, who sold the drive business to Western Digital who sold the drive business to Toshiba.

I believe these Deskstar types and derivatives are the same essential mechanism and processes as those old quantum drives (probably especially in terms of the QA processes). The heads and the technology have improved to give better capacity of course, but I've been buying and relying on these drives for ~25 years at this point.

I'm not surprised to see them showing up well on these charts.

currysausage · on Feb 17, 2016

Please check facts before posting.

> Quantum [...] were beating IBM and IBM needed more capacity so IBM bought them.

No, Maxtor bought Quantum's HDD business, and Seagate later bought Maxtor [1], [2], [3].

> Western Digital who sold the drive business to Toshiba

No, WDC sold some assets related to desktop (not server) 3.5" (not 2.5") HDDs to Toshiba [4], [5], [6].

[1] https://en.wikipedia.org/wiki/Maxtor#Acquisition_of_the_Quan...

[2] http://www.cnet.com/news/maxtor-buys-rival-quantum-to-become...

[3] http://www.pcmag.com/article2/0,2817,1966943,00.asp

[4] https://en.wikipedia.org/wiki/HGST#History

[5] http://techreport.com/news/22553/toshiba-becomes-third-playe...

[6] https://news.ycombinator.com/item?id=8367993

castell · on Feb 17, 2016

Thanks for the full picture!

I have been relying on these drives for 10+ years too and hope Toshiba keeps producing them.

currysausage · on Feb 17, 2016

Not quite the full picture. And there's bad news: Newer Toshiba desktop drives look more like the server stuff that they have been manufacturing since acquiring Fujitsu's HDD business. Plus, Toshiba might give up the HDD business entirely. [1]

[1] http://seekingalpha.com/article/3827746-seagate-western-digi...

bitL · on Feb 17, 2016

Seems like WD effect appeared on their 8TB drives :-/ My NAS has 5x 4TB HGST I bought due to previous Backblaze reports and am waiting to figure out which 8TB drives should I buy - Seagate Archive had really bad real-world usage reviews and now HGST seems to be slipping as well :-(

tw04 · on Feb 17, 2016

If you look at the number of drives and the length of time they've had the drives, you quickly realize that you need to take it with a grain of salt. They even state in the article they don't have enough data to come to any conclusion on those drives.

bitL · on Feb 18, 2016

Sure, but seeing older HGSTs running longer time having 0-2% failure rate and 8TB one >5%, which brings it to WD territory, is not very encouraging.

gradstudent · on Feb 17, 2016

I don't really understand their methodology for computing failure rate. The page says they calculate the rate on a per annum basis as:

([#drives][# failures]) / [operating time across all drives]

Wat? The numerator and denominator seem unrelated. What is being measured here?

To me, it would make more sense to look at time to failure. Together with data on the age of the drive and the proportion of failures each year one could create an empirical distribution to characterise the likelihood of failure in each year of service. That would give a concrete basis from which to compare failure rates across different models.

snaily · on Feb 17, 2016

Are you referring to the "(100*drive-failures)/(drive-hours/24/365)"? There's no multiplication of total # of drives and # of failures in there.

It's all just a scaling: you have a number of broken drives in a corner of the datacenter in the wire bucket that says "broke during 2015", you count them, divide by total hours of that type of disk running (since they may have been brought in commission at different points), and then scale it so you get it in percent-per-year, not likelihood-per-hour.

It smells of someone explaining code, rather than illustrating an important engineering formula, but there's nothing wrong with the rescaling calculation per se.

gradstudent · on Feb 17, 2016

> Are you referring to the "(100drive-failures)/(drive-hours/24/365)"? There's no multiplication of total # of drives and # of failures in there.
Perhaps the problem is the specific example given. 100 is the size of the drive fleet and also the multiplier required to convert to percentages. Let's assume you are right and the 100 in the equation is not #drives.
Even so, I find the approach questionable. If the point is calculate the proportion of failures then that (overly simplistic) calculation is:
[#failures] / [ #drives] = 5 / 100 = 5% failure rate.
But this isn't what's calculated. Instead the author calculates the proportion of drive-years per annum affected by failure. For the 100 drives in the example the cumulative number of operational hours given in 2015 is 750K hours (out of a possible 876K hours, had the drives been operating 100% of the time).
That's a problem because 750 / 876 = 85.6% of total time.
5 / 85.6 = 5.84% "failure rate" which seems to me an overstatement.
The problem gets worse as the number of operational hours decrease. Imagine for a moment the 100 drives only operated 50% of the time in 2015. We have:

100 (5 / ((875K*0.5) / 875K)) = 10% "failure rate". This despite only 5% of the drives having failed.

Wat?

eterm · on Feb 17, 2016

Survival data is tricky to model. They have just done an overall average number of failures per running hour. There do seem to be a lot of factors not taken into account here such as drive ages, (in both running time and elapsed), etc.

With right censored data (as this is), if you measure age at death but then you're only modelling already failed drives, so you'll under represent good drives.

It would be good to see some statistics done so we can see confidence intervals around a hazard rate at different ages.

gcb0 · on Feb 17, 2016

yep. it is not science. they don't have... they don't even consider a null hypothesis.

It is raw data that they provide in an excellent fashion.

You can consume it raw and trust their high driver numbers to drive p high enough for you, or you can use their data for real science. either way, it is a great deal that they take the time to share it.

SixSigma · on Feb 16, 2016

A useful additional metric is the age of the drive at failure.

This would determine if the failure rate was constant for the life of the drive (meaning random failure) or is it age related (infant mortality or old age).

25 drives that fail after 1 week plus 25 that fail after 50 weeks is different to 50 drives that fail one per week.

dsp1234 · on Feb 16, 2016

Luckily they open source their operational and smart status for all of their drives[0]. This means that you can do this additional analysis (and more). Which is awesome.

[0] - https://www.backblaze.com/hard-drive-test-data.html

SixSigma · on Feb 16, 2016

Brilliant, thanks for letting me know. I'm studying Logistics Engineering and reliability analysis is part of my degree. This will make a great case study :)

jedberg · on Feb 16, 2016

To save you some time, they did this analysis previously. I can't find it, but the summary was that the failures happen at the two ends and not a lot in the middle. ie. a bunch die early (infant mortality) and the rest die pretty late (old age).

budmang · on Feb 16, 2016

This was the original blog post where we analyzed hard drive reliability and saw that it followed a bathtub curve: https://www.backblaze.com/blog/how-long-do-disk-drives-last/

jedberg · on Feb 17, 2016

Yep, that's the one I was thinking of. Curious, did you ever do a follow up? Have 50% of the original drives died yet?

SixSigma · on Feb 17, 2016

Well, that's just the analysis I was thinking about. Good looking charts there, nice work.

tshannon · on Feb 16, 2016

"...give or take a Petabyte or two"

As one does.

atYevP · on Feb 16, 2016

Yev from Backblaze -> when you have 200 of 'em you lose count :P

jmnicolas · on Feb 17, 2016

I guess in a few decades Smartphones with only one or two Petabytes of storage space will feel horribly cramped ;-)

samstave · on Feb 16, 2016

What's a few petabytes between friends?

akulbe · on Feb 16, 2016

Color me skeptical. I bought into this, at first. After reading some other stuff, not so much.

Like this, for example: http://www.tweaktown.com/articles/6028/dispelling-backblaze-...

bloaf · on Feb 17, 2016

Sure, lets look at the specific criticisms here:

1. They rip hard drives from external enclosures.

2. They have too much vibration in their pods.

3. They don't correct for temperature.

4. They worked the drives too hard.

The whole article reads like the excuses of someone with a vested interest in discrediting evidence of their favorite brand's poor performance. I don't think the take away from the data provided by Backblaze is "I can expect to get a failure rate of exactly 1.231971 if I buy brand X's hard drives." The end-user-useful conclusions are things like "HGST's drives are the best," and "6TB drives are less reliable than 4TB drives right now."

Sure, all of the factors listed in the criticism may play a role in the failure rates (except the external enclosure bit, since A. The majority of the "shucked" drives were 3TB, and B. They've outgrown that practice.) But they only have the weakest of justifications for believing that those factors vary systematically across the manufacturers. And indeed, even those factors did vary systematically we'd still get the right answer if we had made the more general conclusions. For example, if the vibrations in the seagate-only enclosures are greater than the vibrations in the HGST-only enclosures, that can only be because the HGST drives are better and vibrate less. Or alternatively, maybe the pods all vibrate the same, but HGST is better because it is more resistant to vibrations.

kijin · on Feb 17, 2016

True, and what those criticisms actually show is that Backblaze's data is highly relevant for the average consumer.

I regularly buy external HDDs, rip them out and put them into desktops and laptops, put them back in different enclosures, and so on. As a result, my HDDs experience a lot of movement and extreme temperatures (e.g. being left in the trunk of a car on a hot summer day). It's good to know which models are the most likely to survive such abuse in the long term.

Gatsky · on Feb 17, 2016

Thanks for this. Although it makes some important points, it reads as if the author is annoyed that backblaze's data from tens of thousands of drives is getting so much press, compared to the rather useless single drive reviews published by sites like tweak town.

It's also a bit disingenuous to criticize back blaze's methodology when you know that a 'comprehensive study' under more controlled conditions will NEVER actually happen with the necessary sample size to draw conclusions.

Stress testing is a valid methodology for determining reliability - eg car makers crash their cars into walls at high speed to make sure they are safe, or use a robot to push the brake pedal a million times to see when it fails - so they hardly deserve criticism for pushing the drives hard. More information for the consumer is a good thing.

gareim · on Feb 17, 2016

I read the same thing a year ago and I came away actually upset at the tweaktown article. Off the top of my head, I remember some of the complaints being that the drives were subject to abnormal amounts of heat and that the drives were consumer-level drives.

I remember a study Google did on harddrive reliability and it seemed to show that heat had little to no effect on it. I also don't regard consumer-level as being a bad thing. As a consumer, I kind of want to know which drives are built for abuse better. All drives fail; which drives fail more and at what cost?

ngoede · on Feb 17, 2016

The tweaktown article did talk about temperature. I think you were right to feel they were being silly with that. Temperature MAY correlate with failure but Backblaze found it did not do so within the ranges they actually see in their environment. Something about which it appears they would have more than enough data to be able to compute.

hga · on Feb 17, 2016

Google's study some time ago found that temperature either didn't correlate with failures or, in the ranges they ran their machines, had an inverse correlation with failure. It would appear to be one problem disk manufacturers have largely surmounted.

pbreit · on Feb 17, 2016

The Tweaktown article is a straight hit piece. I'm not really sure what the motivation would be. The writing is so sloppy and negative that it's hardly compelling.

atYevP · on Feb 17, 2016

That gets posted every time we publish our stats. It was an...entertaining read ;-)

ngoede · on Feb 17, 2016

I didn't think their arguments that Backblaze's early drive failures(First week or what have you) can be explained by their purchasing methods. My understanding is that they still see this well after they have stopped buying from Costco ect.

akulbe · on Feb 16, 2016

I love it when people downvote you because you might disagree, or present some conflicting data. Good job.

miander · on Feb 17, 2016

I've noticed this to an extreme degree on HN lately to the point that I upvote posts I disagree with because they present a valid point. The only reason I can see for it is that they have a different opinion from most HN readers.

prawn · on Feb 17, 2016

HN has specifically mentioned using the downvote option when you disagree with something.

Ezhik · on Feb 16, 2016

Are Backblaze the company that bought all the hard drives in the Bay Area during the 2011 crisis?

DanBC · on Feb 16, 2016

Possibly - there's a blog post where they trawled consumer stores to buy drives and drives in enclosures, and then removed the enclosures.

https://www.backblaze.com/blog/farming-hard-drives-2-years-a...

And the discussion on HN:

https://news.ycombinator.com/item?id=6801334

https://news.ycombinator.com/item?id=4631027

bigiain · on Feb 17, 2016

Not just the Bay Area either. They coined the phrase "hard drive shucking", 'cause they were buying consumer external usb drives, then digging the drive out and thorwing away the enclosures.

That story is linked down near the end of the article:

https://www.backblaze.com/blog/backblaze_drive_farming/

atYevP · on Feb 17, 2016

Yev from Backblaze here -> Not ALL the hard drives, but yes. In fact we explicit told anyone that was out buying hard drives for us to leave some on the shelves for the average consumers going in to the stores, hopefully they listened.

slowhands · on Feb 16, 2016

Good data, but I wish they would have rendered these tables using HTML. Not fun typing these out myself to search.

brianwski · on Feb 16, 2016

You can download the raw data from https://www.backblaze.com/hard-drive-test-data.html

cableshaft · on Feb 16, 2016

I've head a lot of bad luck with Western Digital hard drives lately. Nice to see some data back that up. I didn't know HGST existed, though.

sithadmin · on Feb 16, 2016

HGST and its cousin HDS don't get nearly the recognition they deserve in the North American Enterprise storage market. Their products have, in my experience, always offered phenomenal value and rock-solid reliability at very reasonable prices. HDS arrays in particular are pretty great at outperforming 'big name' storage vendors at far lower prices.

kijin · on Feb 17, 2016

I think they changed brands too many times to keep their enterprise reputation intact. Few people even remember that HGST used to be Hitachi which used to be IBM.

On the other hand, those who do remember IBM hard drives probably remember them as the Deathstar, so HGST might not want to be associated with their old home so much.

gist · on Feb 16, 2016

What would be really helpful is if they could simply put some amazon links on this report to the drives with the best reliability according to their tests.

Someone1234 · on Feb 16, 2016

Then people would accuse them of shilling/being biased.

gist · on Feb 16, 2016

People will always be sour and accuse you of things.

But actually I don't see how this makes them biased in any way. All drives essentially sell for the same amount (and Amazon pays a percentage of that) so if you trust the info as being accurate (and why wouldn't it be?) then how could it biased then given there is such little lattitude in pricing?

And who is going to accuse them anyway? People who read HN? If so, so what?

The data presented is a nice shortcut answering the question of "which drive should I buy" without having to read all of the charts and most importantly think.

Lastly, you don't have to buy from amazon just because they give you a link but it does make it easier to see a price and compare to whatever vendor you might typically use (or provide several links to different vendors).

binarymax · on Feb 16, 2016

Why would BackBlaze send business to Amazon?!

brianwski · on Feb 16, 2016

Brian from Backblaze here. If Backblaze would become an Amazon affiliate, IF you clicked our link and then purchased a hard drive, Backblaze would get about 3% "kickback" from Amazon! (That's the way the Amazon Affiliate program works, you provide a link and you get 3% kickbacks.) The problem is we would look like we are "pushing" drives to get the 3% kickback and it damages our credibility and reputation.

As a backup company, we hold ALL our customers data, so our reputation is incredibly important to us. People MUST trust us as impartial and trustworthy and not sleazy or we would go out of business quickly.

gist · on Feb 16, 2016

> The problem is we would look like we are "pushing" drives to get the 3% kickback and it damages our credibility and reputation.

1) So what does it look like now with what you are doing? For example you are offering free credible information about drive reliability which contradicts what you actually do which is make using drives for backup irrelevant. While I am sure that the following is not the case, I could easily say that you are doing this to make people think drives aren't reliable and hence they need backblaze! Wow look at drive failure I should DIY this! (Do I think that is your strategy? To repeat I don't..)

2) Note that http://www.dpreview.com was purchased by Amazon and it has only grown larger and more reputable (in terms of the reviews) since then. And they openly link to Amazon and they could easily be accused of a tremendous bias but apparently they either aren't worried about that or the effect is nominal.

3) I can fully understand, as a business decision, why you might not want to "cheesy" up (my words) your site with amazon links or perhaps you might feel the 3% is not consequential enough to do so. It is certainly a judgement call. However don't assume that everyone that would be a potential user of your company really would think that way because I can assure you that isn't the case.

> we hold ALL our customers data, so our reputation is incredibly important to us.

The fact that you are earning money from affiliate links does not mean you are not reputable and doesn't give me any less confidence that my data will be safe. It's a non issue (for that reason). You have a right to earn money in any reasonable fashion. Affiliate links are an accepted way to earn money (we aren't talking about selling customer data). If anything I think almost the opposite. I want to know that you are making money and robust in business practices so you have the funds to insure your operation will continue for the foreseeable future.

brianwski · on Feb 18, 2016

> perhaps you might feel the 3% is not consequential enough to do so

We struggle with it internally, I assure you we doubt ourselves all the time. :-) Some companies have an "informal fun loving" outward appearance, like if you purchase from Zappos they send emails like "the magic elves are making your shoes, we will send them along very soon..." But bankers tend to wear suits and ties and appear "very serious" in their communications even while frittering away your money on sub prime mortgages.

Anyway, the point is I'll forward your note along and heck, maybe next quarter our drive stats blog post will have Amazon links and we'll make a little extra money. :-)

gist · on Feb 18, 2016

I think it's worth trying.

However it's important that you wrap this in the proper words [1] not just plop the links on the page.

You need to explain the links but without apologizing for putting them there. You can even say perhaps that you were asked to do this (because you were). And don't chicken out and say you are donating the $$ to charity or anything like that.

Depending on how you write this, you will minimize the whiny blowback (if any). That said, running a business is not running a popularity contest to the tune of the most vocal commenters on HN or reddit or wherever.

If you are not doing so already you might want to issue traditional press releases with your results as well.

Of course if you do the links (and I would try this for more than one quarter) if it works or if it doesn't work you can then do a blog post on that!

[1] In the business I am in we charge for a service that our other competitors give away for free. By wrapping it in the proper words we often get a thank you instead of a complaint.

pkaye · on Feb 16, 2016

I wish there were similar statistics publicly available for SSDs. From these failure rates, hard drives don't look as reliable as one would imagine.

budmang · on Feb 16, 2016

I actually am blown away by the reliability of hard drives. We've found that after 4 years, nearly 80% of hard drives are still working, and the median life is about 6 years: https://www.backblaze.com/blog/how-long-do-disk-drives-last/

Considering a 4 TB hard drive has to track 32,000,000,000 individual bits, allowing reading and writing repeatedly of each one, on platters that are spinning 120x per second, spaced a hair's width from their heads...I think it's actually incredible.

As for SSDs, we keep wishing that we could switch to them, but they're still 10x more expensive on a $/TB basis. That may change in the next few years, and if it does, we'll look forward to sharing data on SSD usage at scale as well.

Gleb (CEO, Backblaze)

pkaye · on Feb 16, 2016

I guess my point about hard drives is most people never back them up and kind of always expect them to hold up over 5-10 years. They have years of photos, videos and documents stored on them. Then there are friends savvy enough to setup a raid system and invariably the raid hardware fails before the drive does and they can't get a replacement.

Thanks again for sharing the drive reliability statistics.

atYevP · on Feb 16, 2016

Well those folks should have Backblaze ;-)

jamesblonde · on Feb 17, 2016

Do you have any idea about how many times you read/write individual bits before you encounter errors (are manufacturer quoted BERs accurate)?

abecedarius · on Feb 16, 2016

Even more: 32,000,000,000,000.

sandworm101 · on Feb 17, 2016

Is it possible for this data to ever be useful? Given the time necessary to acquire the data, and the rate at which improvements are made to drives, cannot we make the assumption that drives purchased today probably won't operate in exactly the same manner as drives purchased a year ago?

I don't mean to insult, just to ponder the relevance of such long-term studies on tech that changes so quickly.

taylorwc · on Feb 17, 2016

My takeaway in the long run is in trying to narrow the list of HD manufacturers I am comfortable purchasing. Your point has truth in it, but it also seems fair to observe that companies known for producing reliable products consistently will continue to do so, all other things equal.

baruch · on Feb 17, 2016

When the data is consistent for several years you should already figure out that Seagate is not going to improve so fast, when they do improve in BackBlaze data you can start buying them again.

Large companies may buy Seagate due to the price advantage and the fact that their storage systems can better handle the drive failure rate.

kirian · on Feb 17, 2016

The Seagate drives do seem to be improving in reliability though. The higher capacity Seagate drives which I presume are newer models have better failure rate numbers than the lower capacity drives. The 4 and 6TB drives seem to have reasonable failure rates compared to the other manufactures - only HGST is better than Seagate for the 4TB and Seagate 6TB drive has a lower failure rate than the HGST 8TB. FOr >4TB drives the Seagate 6TB has the lowest failure rate.

6TB 1.89% 4TB 2.19/2.99% (depending on model) 3TB 5.1/28.34% (depending on model) 2TB 10.1% 1.5TB 10.16%/23.86% (depending on model)

toast0 · on Feb 17, 2016

This is definitely useful, although maybe not for purchasing. It lets you know which smart attributes are most useful, for example. Also, given the periodic reports, you can make judgments about how brands are trending (although you have to be careful about age factors). I think their reports showed 3tb drives are not great as well.

eps · on Feb 17, 2016

It'd be interesting and quite helpful to see the failure rate vs. drive age, per manufacturer.

For example, for less reliable manufacturers there might be a "if you get past first N weeks, you are fine" pattern, or a failure cliff exaclty 1 week past the warranty period, or something equally entertaining.

Quequau · on Feb 17, 2016

Just in time for me.

I've got 5 Western Digital drives which have failed out of original purchase of 6. Now I'm wondering if it's really worth it trying to go through the RMA process (I need to figure out exactly how old they are and how long the warranty is) or if I should just give up on Western Digital and go with a different manufacturer... though I am not looking forward to spending that amount of money all at once.

aembleton · on Feb 17, 2016

The best performing brand seems to be HGST which is a subsidry of Western Digital

jamesblonde · on Feb 17, 2016

Great stuff. Does anybody have any stats for drives' Bit Error Rates (BER) / maximum unrecoverable read errors (URE) / non-recoverable read error rates ? By my understanding, manufacturer quoted BERs for commodity drives, often 10^14, tend to be 10^15 or higher in practice.

zanny · on Feb 16, 2016

This information is super useful. I have an ST3000DM001 and only trust it because its smart stats are still all in the green (and of course I have local and cloud backups of anything important).

I've had it for four years now and there are no warnings of any kind yet, so I guess I got one from a good batch.

m3rc · on Feb 16, 2016

On the basis of buying a single personal hard drive this data is interesting but wouldn't have much impact on your purchasing. As usual the advice is to have multiple backups of everything

ck2 · on Feb 16, 2016

Always look forward to this report, thanks for sharing the data.

Amazing the 4TB hitachi with twice the platters of the 2TB fail less.

(and I will never buy seagate again for home pc or servers, even before this report I could have told you they are unreliable)

dsr_ · on Feb 16, 2016

That's the wrong message to take away. The right message is: every manufacturer goes through periods of good and bad disks. Don't depend on any drive to be perfect.

The Seagate 3TB were awful, but their 4TB seem to be just fine.

voltagex_ · on Feb 17, 2016

I've had two Seagate drives "fail" recently after <2 months - the drive is fine but the (OEM) USB3 enclosure is dead. No idea who manufactures those for Seagate but I'm not impressed.

hinkley · on Feb 17, 2016

Similar experience with WD. I bought a western digital USB drive. It failed, I RMA'd it for a new one. It failed. A year later I bought a bigger one, and it failed after another year.

I cracked open the enclosures and the drives are just fine. I still use them for backups with no errors.

dghughes · on Feb 17, 2016

Total 213,355 Terabytes or 213 Petabytes, that's quite a bit.

jmnicolas · on Feb 17, 2016

Honestly not that much : to feel comfortable at home I would need 20 TB of storage (of course it's only Linux ISO ;-). A bit more than 10 thousand people like me and they would have to reorder more drives.

dghughes · on Feb 17, 2016

At the datahoarder subreddit according to his flair one of the mods claims to have 1.4PB.

deno · on Feb 17, 2016

You probably don’t put your “Linux ISOs” in the backup.

mozumder · on Feb 16, 2016

Would be more interesting to find out reliability figures for high-throughput data-center models of hard drives instead of backup drive models, with low access rates.

64bitbrain · on Feb 16, 2016

Are there similar results or survey for SSDs?

pbreit · on Feb 17, 2016

Sort of off-topic and apologies for the commercial nature, but can you really get a 2 TB thumb drive for $17? Do they work?

https://www.wish.com/search/2%20tb%20thumb#cid=5683434cce922...

pipeep · on Feb 17, 2016

It's a common scam to sell flash drives with modified firmware that causes it to report a larger size than the underlying flash chips provide. Usually once you write past that point, the writes will loop around or drop the data entirely. It's likely that the $17 flag drive you linked to is a scam.

Beyond that, flash drives tend to have low write durability and horrible performance on large writes (because of poorly implemented garbage collection).

contingencies · on Feb 17, 2016

I have 2 x ST40000DM and 1x ST40000VX in my desktop, plus one 4TB Seagate 'surveillance' drive as a USB luggable, though OSX to which it is currently connected doesn't want to give me the specifics (neither right-click Info, nor DiskUtil).