Here is what I need, so if someone has a solution I’d happily hear it: Take as input an arbitrary list of company URLs, and then deliver as output a list of locations for all the companies. I don’t want to know where they’re hosted, or other domain registration information, but headquarters office data for all companies.
Anyone had to do this before?
I run into this problem all the time and have yet to come up with a good solution. The obvious one, of course, would be to do some semi-intelligent scraping, but the problem is just infrequent enough, and the requisite code just complex enough (locale is never in the same place, nor stored in the same way) that I haven’t rolled a Perl script.