Dell: 90% of data is never read again
Posted on 8 Jul 2010 at 15:03
Steve Cassidy agrees there's no such thing as a free lunch, but has lunch with Dell's enterprise division anyway
According to Dell, 90% of company data is written once and never read again.
This arresting claim cropped up in the middle of a presentation from Dell’s Enterprise division, recently given to Jon Honeyball and me. Given our usual style of dealing with such events, the poor devils didn’t stand a chance of actually working though their prepared order of slides, and I’d have to confess that we didn’t even try to stick to the script we’d discussed in the run-up to the meeting.
The poor devils didn’t stand a chance of actually working though their prepared order of slides
But even allowing for our natural tendency towards anarchy, this statement stood right out from the other stuff in the presentation.
It’s an odd statistic. How is that data measured? 90% of all documents? 90% of stored bytes? When they said “ever again” did they mean explicitly retrieved by name, or should we include free text searches in that statistic? How long an interval needs to pass before some piece of data is clearly identified as belonging to the 90%, so that steps can be taken to reflect its reduced importance?
These questions are just the starting point for an issue that demands quite a lot of thinking. It’s a fascinating finding to be offered to you by a vendor of servers, given that so few of the devices they try to sell to smaller organisations actually reflect this “fact” in their hardware and software specification.
What’s more, when larger companies try to make use of the sort of gadgetry that is available to take account of this fact, all too often the tricks involved end up being little more than a nuisance and a source of delight to no-one at all (apart, perhaps, from those who equate control with success).
I expect that if Jon and I hadn’t derailed them so enthusiastically, Dell’s sales guys would have proceeded by trying to talk up the toolkit provided to combat the effects of this “natural law of data”: namely, the provision of hierarchical storage measures based on a larger scale, corporate grade iSCSI SAN, plus supplementary devices such as deduplicators, robot-controlled tape silos and archiving utilities.
Even though there’s been some degree of trickle-down of these categories of product into our humbler networks over the past half-decade, the impact of that 90% dead-weight rule still isn’t something that crops up when we’re considering software purchases or hardware choices. We’re offered Windows Server “Enterprise” Edition, not Windows Server “90% Unused” Edition.
Where this 90% dead-weight rule does get taken into account is perhaps in our other choices: it becomes most important when you’re figuring out which shared drive letters to present on your network, and how you’re going to divide up the company’s business among various folders and drives, what the security groups are going to be – and, most importantly, in your estimates of a reasonably representative daily pattern of work loading.
Let’s look at just one example of such a decision to highlight the way the 90% dead-weight rule can affect your thinking and purchasing.
One of my clients found itself bumping up against the storage limits of its single-box servers. It’s astonishing how often you find that servers run out of puff at around the 2 to 4TB mark when it comes to presenting shares to the LAN.
This particular client is a big fan of HP’s ProLiant DL580 G5 series servers (de facto standards don’t come much more widespread than this one) and the G5 employs HP 2.5in SAS drive units as the standard bricks for building up logical drives.
The difficult part is to identify which 10% of the data is wanted, and separate it logically from the 90% that isn't.
By wombatmobile on 10 Jul 2010
It's About Ownership of the Data
No matter what the percentage of stale data, it's impossible to put it onto cheaper storage until you know who in the business it belongs to. Just use last accessed date is completely insufficient - many automated processes will trigger a last modified, effectively hiding stale data from archiving solutions.
What's needed is intelligence on who can access data (a complete permissions picture) and who actually is accessing the data (a complete audit trail). Native auditing for Windows and Unix basically breaks the box, and while NAS devices can pretty easily collect it, you need better solutions for actually using it.
Data governance is about much more than just shuffling off the 90% of inactive data. Without a complete picture of business ownership, it's impossible to govern data effectively.
By chfbrian on 11 Jul 2010
Catch 22 and an error
The sad thing is, just when you move off "Stale" data, you suddenly need it again. You don't need it until you don't have it, and if you have it, you don't need it. Deduplication can help condense some data so you can store more on less, but it can only do so much.
Also, the HP DL580 G5 mentioned in the article can be upgraded to 16 disks with a second drive cage. That would double the figures noted above.
By Saberus on 12 Jul 2010
- Headings vs headers: how to use both in Word
- Windows Server 2012 R2: how the Datacenter edition could change SMBs
- Invoices and VAT: how to set up your documents correctly
- Nexus 5 vs Samsung Galaxy S4 Active: the best phone for avoiding screen burn
- How much is a social user worth?
- The key to choosing a secure password
- Thunderbolt Bridge: a fast Mac migration tool
- Should you advertise on Twitter?
- How to track a lost smartphone
- Self-publishing success: the best way to sell your book
- The 5 most interesting UK businesses at SXSW
- Quickest way to upload 1GB? Hop on a train
- Move over Delia: IBM Watson is cooking tonight
- Eric Schmidt on the double-edged smartphone: friend and foe
- Getty joins the race to the bottom
- Hour of Code: five steps to learn how to code
- Sony Xperia Z2 Tablet review: first look
- Sony Xperia Z2 review: first look
- Samsung Galaxy Gear 2 review: first look
- Nokia XL review: first look
- IDC: iPad intertia opens door for Windows tablets
- Office 365 goes social with "Oslo" news feed
- Windows XP: upgrading 30,000 PCs in 30 days
- LibreOffice: ignore Microsoft's "nonsense" on government's open source plans
- Intel Xeon E7 v2 servers support 6TB of RAM
- Microsoft promises video calls between Skype and Lync
- Office for iPad due before July
- Windows 7 on business PCs gets an extension
- Windows apps land on Chromebooks with VMware
- Office 365 gets two-factor authentication