![]() |
|
|
Thread Tools | Display Modes |
![]() |
#1 |
Badcaps Veteran
Join Date: Nov 2011
Posts: 2,301
|
![]() Not wanting to limit this to actual "SSDs" but, rather, "non-rotating storage media" (including FLASH "soldered down")...
What's been your experience with "flash" failures -- typically in tablets, phones (or even SSDs)? And, with the exception of SSDs (which can typically be replaced as an FRU), what recourse have you had to address said failure(s)? |
![]() |
![]() |
![]() |
#2 |
The Boss Stooge
Join Date: Oct 2003
City & State: Salem, MO
My Country: United States
Line Voltage: 240V @ 60Hz
I'm a: Professional Tech
Posts: 15,032
|
![]() had my first SSD die on me last week.....it was an OCZ Trion 150, 240gb. I've never had a tablet SSD fail, but I'm not a tablet junkie.....I still prefer a laptop.
__________________
<--- Badcaps.net Founder & Owner Badcaps.net Services: Premade Capacitor Kits Badcaps.net Capacitor Master List Motherboard Repair Services If you've come here in search of replacement capacitors or repair services, please use the links above. ![]() ---------------------------------------------- Badcaps.net Forum Members Folding Team http://folding.stanford.edu/ Team : 49813 Join in!! Team Stats |
![]() |
![]() |
![]() |
#3 |
Badcaps Veteran
Join Date: Nov 2011
Posts: 2,301
|
![]() By "die", did it simply stop working altogether? Or, did it's FTL prove incapable of coping with write wear-through? I.e., spinning rust should, theoretically, degrade gracefully as the grown defect table gets bigger (eventually impacting the capacity of the volume). The FTL should provide similar functionality for solid state media with the caveat that the entire medium will, eventually, fail.
|
![]() |
![]() |
![]() |
#4 |
The Boss Stooge
Join Date: Oct 2003
City & State: Salem, MO
My Country: United States
Line Voltage: 240V @ 60Hz
I'm a: Professional Tech
Posts: 15,032
|
![]() I'm not sure what caused its death...I switched it from one interface to another (ICH7 to an ICH10), and it never worked again. BIOS sees it, but it can't be read from or written to. Tried several utilities to gain access to its data, no go. I shut it off and pronounced it dead when DBAN said the time remaining to wipe it was 640 hours....
|
![]() |
![]() |
![]() |
#5 |
Great Sage 齊天大聖
Join Date: Dec 2009
City & State: Europe
My Country: some shithole run by Israeli agents
I'm a: Professional Tech
Posts: 26,642
|
![]() obviously you tried it on the original machine?
maybe a damaged sata connector. |
![]() |
![]() |
![]() |
#6 |
The Boss Stooge
Join Date: Oct 2003
City & State: Salem, MO
My Country: United States
Line Voltage: 240V @ 60Hz
I'm a: Professional Tech
Posts: 15,032
|
![]() |
![]() |
![]() |
![]() |
#7 |
Great Sage 齊天大聖
Join Date: Dec 2009
City & State: Europe
My Country: some shithole run by Israeli agents
I'm a: Professional Tech
Posts: 26,642
|
![]() fine on the drive pcb?
reason i say this, sata is a fast serial system - too damned fast. data is sent in small packets and corrupted ones are re-sent. the theory is that if the errors arent too high the transfer rates are still impressive. if you get a bad signal from a poor cable or connector the data still makes it but it's much slower - potentially so slow that more than a handfull of bytes is going to timeout. |
![]() |
![]() |
![]() |
#8 |
EVs Rule
Join Date: Apr 2011
City & State: Leeds
My Country: UK
Line Voltage: 230Vac 50Hz
I'm a: Professional Tech
Posts: 32,373
|
![]() SSDs do fail, I've had friends with Kingston drives go bad. None myself, had the odd SD card and USB drive fail though. Personally I do not trust any SSD not from a tier-one manufacturer (Samsung, Sandisk, Intel and *maybe* Toshiba), manufacturers like Kingston are frequently swapping the parts used on their SSDs so performance is inconsistent and lifespan is never guaranteed. See, for instance, the Kingston V300 debacle [1].
Newer SSDs are moving to 8-level or 16-level flash, so the per-cell density is getting really high, but the trouble is, all flash memory is damaged by erase operations. And as the cells are written more often, their leakage increases, so they hold data less reliably over time. This is one reason flash memory is terrible for archival purposes. If it is a high-density SSD, don't expect it to retain data without power for more than 10 years or so. A powered SSD is a happy SSD, because the controller can remap the drive periodically, when under little load. One other significant factor is ambient temperature. When I was employed at a large set-top box manufacturer I tested a number of flash memory devices in STBs at temperature. The unit that was running at 40C had about half the cycle life of a unit running at 25C. So, if you can position your SSD so it runs cooler, that's better. One reason I really don't like M.2 drives is because they are located so close to the hot CPU, compared to a SATA drive. Higher temperatures at high cycle counts are the worst-case for flash memory. [1] https://www.anandtech.com/show/7763/...er-micron-nand Last edited by tom66; 12-27-2018 at 12:25 PM.. |
![]() |
![]() |
![]() |
#9 |
Great Sage 齊天大聖
Join Date: Dec 2009
City & State: Europe
My Country: some shithole run by Israeli agents
I'm a: Professional Tech
Posts: 26,642
|
![]() |
![]() |
![]() |
![]() |
#10 | |
Badcaps Veteran
Join Date: Nov 2011
Posts: 2,301
|
![]() Quote:
[Coincidences always leave me suspicious...] |
|
![]() |
![]() |
![]() |
#11 | |||
Badcaps Veteran
Join Date: Nov 2011
Posts: 2,301
|
![]() Quote:
Quote:
Quote:
OTOH, I have colleagues who complain that they can't read a CD that they wrote a few months earlier (PEBKaC). An SSD can typically be removed/replaced leaving you with a usable piece of kit (sans SSD). OTOH, tablets, phones and other appliances usually have their solid state memory "soldered down". So, a failure in the media OR a failure in the FTL can result in bricking the device with no hope of salvage. |
|||
![]() |
![]() |
![]() |
#12 | |
EVs Rule
Join Date: Apr 2011
City & State: Leeds
My Country: UK
Line Voltage: 230Vac 50Hz
I'm a: Professional Tech
Posts: 32,373
|
![]() Quote:
Once a sector fails on an SSD, the drive will spend a long time attempting to recover it. This will lead to read latency climbing significantly and random sector failure will also likely cause issues with filesystems. With our STBs, when a failure occurred in the onboard eMMC, the Linux kernel spent about 20 minutes spewing out messages on dmesg/serial terminal before I had a usable terminal. And it effectively became unusuable because each sector read would be rejected after a 10sec delay from the drive controller. Maybe it's possible to configure the kernel to behave more gracefully when this goes bad but AFAIK there is no way for the kernel to know the drive is bad - it just takes forever to read from... |
|
![]() |
![]() |
![]() |
#13 |
Computer Geek
Join Date: Jan 2015
City & State: Nowhereland, Texas
My Country: USA
Line Voltage: 120/2/[email protected]
I'm a: Hardcore Geek
Posts: 1,990
|
![]() More reasons to not trust SSDs with critical data.
__________________
Don't buy those $10 PSU "specials". They fail, and they have taken whole computers with them. ![]() My computer doubles as a space heater. Windows 10? Only if you like forced, buggy updates and 24/7 telemetry. Samsung = Seagate = Seatrash = Trashgate Don't buy Seagate drives. Don't use Seagate drives. If you have any in service right now, make plans to replace them ASAP. SMR = Slow Magnetic Recording Avoid SMR, buy CMR drives instead. SMR is easily a 15+ year step BACKWARDS in HDD speed. Permanently Retired Systems: RIP Advantech UNO-3072LA (2008-2021) - Decommissioned and taken out of service permanently due to lack of software support for it. Not very likely to ever be recommissioned again. |
![]() |
![]() |
![]() |
#14 |
EVs Rule
Join Date: Apr 2011
City & State: Leeds
My Country: UK
Line Voltage: 230Vac 50Hz
I'm a: Professional Tech
Posts: 32,373
|
![]() *Another reason not to have critical data on any single medium *at all*. Backups, folks! Backups! Three backups, two different locations and at least one different type of medium. But if you're too lazy to do that, then at least use an online service e.g. BackBlaze.
|
![]() |
![]() |
![]() |
#15 | ||||
Badcaps Veteran
Join Date: Nov 2011
Posts: 2,301
|
![]() Quote:
Quote:
Environments that load apps "once" from persistent store could stumble along with the user only noticing a startup delay when the app is initially loaded. Quote:
Quote:
Of course, the flip side of this (if its indeed how these devices are designed) is that you're at the mercy of N different FTL implementations, each of which embody considerable BFM. :< |
||||
![]() |
![]() |
![]() |
#16 | |
Badcaps Veteran
Join Date: Nov 2011
Posts: 2,301
|
![]() Quote:
On-line services require a high speed connection to move copies of archives around. And/or support for more advanced protocols (e.g., rsync) to verify their integrity against local copies (and vice versa). For example, it takes a fair bit of time to copy a TB image over Gbe; imagine doing that with many TB! And, being a Cynic, I'm not sure I'd consider any of them "secure", given the number of break-ins/hacks and outright SALES of data that we hear about. |
|
![]() |
![]() |
![]() |
#17 | |||
EVs Rule
Join Date: Apr 2011
City & State: Leeds
My Country: UK
Line Voltage: 230Vac 50Hz
I'm a: Professional Tech
Posts: 32,373
|
![]() Quote:
Quote:
With our eMMC flash on our STBs the fault essentially was that the eMMC wouldn't mount correctly, but the application software didn't like this, so attempted to re-mount it frequently. Each mount attempt took far too long as it relied on a timeout, leading to the unit slowing down considerably. Quote:
|
|||
![]() |
![]() |
![]() |
#18 | ||
EVs Rule
Join Date: Apr 2011
City & State: Leeds
My Country: UK
Line Voltage: 230Vac 50Hz
I'm a: Professional Tech
Posts: 32,373
|
![]() Quote:
I don't keep the stuff on the cloud. It's only used as a backup method, the data is only there in case a failure occurs. Quote:
Without that key the data is useless, it is encrypted on my PC and if I have a drive failure they will ship me a HDD with the encrypted data on it, which I can then recover using that key. If you are so concerned about data security you can trust AES256, it *will not* be broken with current technology and is likely to remain secure for at least the next 20 years. |
||
![]() |
![]() |
![]() |
#19 | |||
Badcaps Veteran
Join Date: Nov 2011
Posts: 2,301
|
![]() Quote:
Without hacking the driver, it gives me similar diagnostics that I have available in other devices I've designed (that just use NAND/NOR directly). There, I watch the device's actual performance against it's "specified" worst case performance to detect potential failures (before they become "double failures" and, thus, less detectable -- the second failure masking the first). While I don't rely on it as a predictor of drive failure, I use it to modify the schedule for "file verification" so that the other files on the physical volume are revisited sooner in case there IS a problem brewing. Quote:
Quote:
In my case (10K's), I'd rather the cost savings AND the enhanced insight to the components' operation as I can't just swap out a drive -- nor do I have a network of retail establishments (phone vendors) that can provide replacement devices on my behalf. |
|||
![]() |
![]() |
![]() |
#20 | ||
Badcaps Veteran
Join Date: Nov 2011
Posts: 2,301
|
![]() Quote:
Quote:
I'm old enough that if something MAJOR happened (house explosion), I'd only fret over the loss of RECENT financial and medical records. And, those are periodically updated on portable media to handle the MORE likely scenario of having to evacuate (fire, flood, terror incident, etc.). Yeah, I'd miss my music archives, book library, technical library, software archive, project history logs, etc.. But, I'd also miss the various bits of equipment that I'd lost -- many of which being irreplaceable and/or essential to make use of the data (apps, source code) that was "lost". Recovering the archive from an offsite store would take months, anyway (getting a machine set up again that could access them -- and make use of them! -- and having the encryption key on my person when I abandoned the office!). Not likely to be the most pressing need I'd have! So, it's just as easy to treat them as disposable, at that point and start over. [Finances and medical, however, have no convenient "reset" and their "need" can prove to be "immediate"!] |
||
![]() |
![]() |
![]() |
Thread Tools | |
Display Modes | |
|
|