Despite all the many RSS feeds to which I subscribe I still regularly scan a non-RSS scraped page I set up three years ago, one that doesn’t even have an associated RSS feed. At www.kedrosky.com/stories.shtml I have a constantly updating list of all headline stories from the NY Times and LA Times business sections, plus a few financial wires, plus some other finance/business miscellania. It is all generated by some Perl-based scraping software that I wrote years ago.
I have been storing all these stories in a database the whole time, and I’m now up to roughly a million entries. While I can search it locally, one day I’ll expose the search on the web. Call it the poor man’s financial Lexis-Nexis!