Sitepen Getting There From Here
2003 scalable networking - unknown
Opinion Mining
Building OpenDNS Stats
Berkeley Performance Tuning
WEBSEARCHFORAPLANET: THEGOOGLECLUSTER ARCHITECTURE
RAMCloud: Scalable Datacenter Storage Entirely in DRAM
Information extraction systems aspects and characteristics
2005 Web Content Mining 4
Tuning web performance
Data-Intensive Text Processing with MapReduce
Wrapper induction construct wrappers automatically to extract information from web sources
Huffman coding
Opinion mining and summarization
Do not crawl in the dust different ur ls similar text
Automatically Generating Wikipedia Articles: A Structure-Aware Approach
Info Q介绍
Deploying Grid Services Using Hadoop
Incorporating site level knowledge to extract structured data from web forums - keynote