Startup LogLogic does a nice job of telling the story about why it’s crucial to think in a nuanced way about the data thrown by your organization every day, and how you can make it searchable and useful. What they call “infrastructure data” I call one aspect of dark matter, but whatever, I like how neatly they capture the essential idea:
There are three major data sources in Enterprises today:
Public Data: all the stuff – files, documents, products that we have in the public domain. Getting at this stuff is pretty straightforward. You Yahoo! or Google it.
Unstructured Data: all the data inside the Enterprise that is more than often locked-up in applications, databases and other systems.
Infrastructure Data: all the data generated by applications, networking gear, servers, operating systems, mainframes and much more. To put it in perspective, Enterprises typically generate upwards of 40 terabytes of data in this class every year at rates exceeding 250 million messages per day.
It’s in this last category that LogLogic 3 comes in. [Emphasis mine]
Related posts: