算法系列:
人工神经网络系列文章: http://www.cnblogs.com/gpcuster/archive/2008/05/22/1204705.html
http://www.codeproject.com/KB/recipes/aforge_neuro.aspx
一个很好的Machine Learning的开源工具网站
http://www.mloss.org/software/
分类算法介绍漫谈:
http://blog.pluskid.org/?tag=unsupervised-learning
支持向量机介绍的不错的文章系列:(很牛)
http://www.blogjava.net/zhenandaci/archive/2009/06/21/254519.html#283487
Chih-Jen Lin's Home Page: http://www.csie.ntu.edu.tw/~cjlin/
http://www.csie.ntu.edu.tw/~cjlin/libsvm/
Google数学之美系列文章:
http://hi.baidu.com/lockingxp/blog/category/%CA%FD%D1%A7%D6%AE%C3%C0%CF%B5%C1%D0%20by%20%CE%E2%BE%FC
Blog系列:
Web抽取有关:
fuliang: http://fuliang.javaeye.com/blog/306759
信息抽取中:http://blog.csdn.net/ictextr9
试验平台:
- IR实验系统: http://blog.so8848.com/2009/06/50714.html
- 数据库领域的软件:
ACM SIGMOD maintains a list of publicly available software as a service for the database community. To allow easier orientation about the nature of the software, we have subdivided the lists into
Packages from non-profit organizations
Non-profit organizations are for example universities or government research labs or open-source development communities.
Packages from commercial organizations
Commercial companies (incl. their research departments) which make certain packages publicly available.
http://www.sigmod.org/databaseSoftware/
3. 数据抽取:
WysiWyg Web Wrapper Factory (W4F) http://db.cis.upenn.edu/DL/WWW8/index.html
代码下载:http://www.pudn.com/downloads106/sourcecode/web/detail438422.html
The Road Runner Project Towards Automatic Data Extraction from Large Web Sites
资源URL:http://www.dia.uniroma3.it/db/roadRunner/
(WEB可视觉化用到的工具) WebBrowser(IE)和XPCOM的资源 (FireFox)
XPCOM
http://www.blogjava.net/sdyjmc/archive/2006/12/04/85409.html
http://hi.baidu.com/xuqingyang/blog/item/17d5403e5cc8a63871cf6ced.html
WebBrowser:
http://www.cnblogs.com/tuyile006/archive/2007/05/18/751455.html
开源系统研究:
分类搜索引擎: http://project.carrot2.org/
软件测试视频系列:
QTP学习视频汇总: http://space.itpub.net/14780873/viewspace-478221