A short presentation ( part 1 of 3 ) describing the use of open source code nutch and solr to web crawl the internet and process the data.