Real-time Streaming and Querying with Amazon Kinesis and Amazon Elastic MapReduce
Structure Data 2014: BIG DATA ANALYTICS RE-INVENTED, Ryan Waite
Part II - Basic Techniques: Search engine architecture Web crawling basics: following links, crawl courtesy,.. Storage Text indexing Querying and term-based.