资源列表
[搜索引擎] Lucene+Nutch
说明:该书首先描述了开发平台的配置, 接着详细介绍LUCENE和NUTCH开发。-The book first describes the development platform configuration, and then details the development of Lucene and NUTCH.<陈灵辉> 在 2025-06-13 上传 | 大小:21.93mb | 下载:0
[搜索引擎] luceneAndnutch
说明:Lucene+nutch构建搜索引擎原书光般内容-the source code of use Lucene+ nutch to build a search engine<snow> 在 2025-06-13 上传 | 大小:21.91mb | 下载:0
[搜索引擎] JTextPro-1.0.tar
说明:JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phrase chunking (using CRFChunker<lgp> 在 2025-06-13 上传 | 大小:20.44mb | 下载:0
[搜索引擎] webcrawler
说明:一个java 开发的网络爬虫,采集功能比较强大-Development of a java web crawler, collecting more powerful features<周Sir> 在 2025-06-13 上传 | 大小:23.44mb | 下载:0
[搜索引擎] heritrix1.14.4
说明:heritrix1.14.4.zip版,欢迎下载-heritrix1.14.4.zip version, welcome to download<观山> 在 2025-06-13 上传 | 大小:21.72mb | 下载:0
[搜索引擎] introduce-to--search-engine
说明:梁斌写的经典搜索引擎入门书籍《走进搜索引擎》,作者为南大毕业,现在在清华读博-Liang Bin, search engines started to write the classic book " into the search engine" , author NTU graduate, and now pursue a Ph.D. degree in Tsinghua University<杨济运> 在 2025-06-13 上传 | 大小:23.14mb | 下载:0
[搜索引擎] PHPSou_v1.2_GBK_20111226
说明:php开发的搜索引擎,蜘蛛抓爬系统等等,适合个人搜索-php development search engine spider Scratch system, suitable for personal search<网络> 在 2025-06-13 上传 | 大小:23.11mb | 下载:0
[搜索引擎] Char04
说明:网络搜索引擎代码,内涵各种爬行算法和相关子程序-This program code designed an eDonkey network crawling system which could avoid being added to the blacklist of the central server and break the count restriction of the results when crawler search something from the server.Af<王乐> 在 2025-06-13 上传 | 大小:20.09mb | 下载:0