搜索资源列表
z_mysearch
- 搜索引擎,使用Lucene2.0+Heritrix构建了自己的搜索引擎,在eclipse上实现-Search engine, the use of Lucene2.0+ Heritrix build its own search engine, to achieve in eclipse
heritrix-1.12.1-src.tar
- 这是个爬虫和lucece相结合最好了,功能强大-This is a reptile and lucece combining the best of the powerful
heritrix-1.10.1
- 一个开源的网页爬虫
heritrix-1.14.0-src
- 知名网络蜘蛛源码,可以下载整站内容,扩展性强,可以下载动态网页
Heritrix_configure
- 如何开始Heritrix的第一个job,自己总结的Heritrix配置说明,文字+图片-How do I get started Heritrix first job, their configuration instructions Heritrix summary, text,+ Picture
heritrix
- web 网络爬虫 用户可以使用它从网络上抓取想要得资源,开发者还可以扩展它的各个组件,来实现自己的抓取逻辑。-Reptile web network users can use it from the network you want to crawl resources, developers can also extend its various components, to achieve their own logic craw
heritrix-1.14.2-src
- heritrix-1.14.2-src是网络爬虫Heritrix最新版本的源码,希望对大家有帮助-heritrix-1.14.2-src is a network of reptiles Heritrix the latest version of source, in the hope that we have to help
LUCENE2·0HERITRIX
- 一个基于lucene&heritrix的搜索引擎-Lucene & heritrix-based search engine
HeritrixInstallation
- 一份Heritrix的安装文档,对初学爬虫的人很有帮助-Heritrix installation of a document, the person on the beginner reptiles helpful
JavaSearch
- 这是我当时为了完成毕设,自己使用lucene、heritrix写的一个搜索引擎系统,能够实现比较简单的搜索,希望对想要的人有点用处-This is my time to complete in order to complete the set, their use of lucene, heritrix Writing a search engine system, be able to achieve relatively simp
heritrix-1.14.3-src
- 高性能分词算法,采用java实现,能自动进行最小分词,用户可以筛选分词类别-Word segmentation algorithm for high-performance, the realization of the use of java, can automatically carry out the smallest sub-word, the user can filter category segmentation
z_mysearch
- 搜索引擎,基于LUCENE2.0+HERITRIX构建的图片搜索引擎-Search engine, based on LUCENE2.0+ HERITRIX build a picture search engine
heritrix-1.14.3
- 网络爬虫开源代码 网络爬虫开源代码-failed to translate
heritrix-1.14.0
- 很不错的源码,大家一起学习,有什么资料共享一下啊,这个网站蛮不错的-good
heritrix-1.14.0-src
- 很不错的源码,大家一起学习,有什么资料共享一下啊,这个网站蛮不错的-good
Search
- 开发自己的搜索引擎-LUCENE 2.0+HERITRIX(源代码)
Lucene2.0Heritrix
- 是对网络爬虫Heritrix的介绍 ,Heritrix是一个由java开发的 开源的web网络爬虫 -Is an introduction to Heritrix Web crawler, Heritrix is an open-source web development java web crawler
luceneandheritrix
- lucene的搜索引擎中文文档帮助,对应书籍:《开发自己的搜索引擎lucene2.0+heritrix》-lucene search engine to help the Chinese documents, the corresponding book: " to develop its own search engine lucene2.0+ heritrix"
119128627heritrix-1.14.0-src
- heritrix-1.14.0-src很不错的资源-heritrix-1.14.0-src is a good resource
heritrix-3.0.0-src
- 网络爬虫源码,基于java开发,能快速、大批量的爬取网页-web crawler