结合Lucene,看看http://www.drdobbs.com/parallel/indexing-and-searching-on-a-hadoop-distr/226300241?pgno=1
http://architects.dzone.com/articles/solr-hadoop-big-data-love
Hadoop的基本安装参考这个就行:
http://www.linuxidc.com/Linux/2011-04/35162.htm
需要注意的就是权限问题,以及SSH登陆问题
Eclipse 插件编译Hadoop:
http://www.ilablog.org/%E7%BC%96%E8%AF%91hadoop-eclipse%E6%8F%92%E4%BB%B6/