Found something interesting today, now we’ll see what it really does. In it’s most basic form it’s an RSS feed for Google to crawl, only you no longer have to wait for Google to hit a link to your blog, page, or whole site each time you make an update, instead you feed Google directly with an XML feed. I’d guess it should get your pages crawled and indexed faster. The terms of service call for not more often than once per hour to push the feed. I’ve got this blog and the CaptiveReefing forums set up to push a feed every two hours, just set up a simple cron job with a wget… we’ll see what if anything really comes if it.
No RSS feed for your site? No problem, they’ve even come up with a python script that you can add to your site to generate it for you to notify them of changes.
Here’s a link to Google Sitemaps.
Now is this really a good thing for all parties involved? A ploy to absorb as much data to warehouse as possible? We know how data hungry the big G is, their storage capabilities are tremendous with literally thousands of servers clustered together, not sure who has more processing power Google or NASA.. might actually be a close race counting teraflops. Something to research next time I get bored.