文件名称:ThemeCrawler

  • 所属分类:
  • 数据挖掘
  • 资源属性:
  • [Java] [源码]
  • 上传时间:
  • 2016-10-02
  • 文件大小:
  • 1.4mb
  • 下载次数:
  • 0次
  • 提 供 者:
  • shi***
  • 相关连接:
  • 下载说明:
  • 别用迅雷下载,失败请重下,重下不扣分!

介绍说明--下载内容均来自于网络,请自行研究使用

现在常见的搜索策略主要分为两种:一种是基于网页链接结构的搜索策略,另一种是基于内容评价的搜索策略。第一种是通过网页之间的链接关系来确定网页的重要性,从而决定链接访问的顺序。此方法虽然考虑了网页链接结构和网页之间的链接关系,但忽略了网页内容与主题的相关度,容易出现网页搜索“主题漂移”。第二种主要考虑网页内容,好处就是思路清晰且计算简单。但这种方法忽略了网页的链接关系,故在预测链接网页价值方面存在不足。考虑到这些问题,提出将布谷鸟搜索算法应用到主题爬虫中。-Now the common search strategy is divided into two kinds: one is based on the link structure of the search strategy, the other is based on content uation of the search strategy. The first is to determine the importance of the page through the link relationships between the pages and determine the order in which the links are accessed. Although this method takes into account the link structure between web pages and links between pages, but ignores the relevance of web content and themes, prone to web search theme drift. The second major consideration of web content, the benefits of clear thinking and calculation is simple. But this method ignores the links of the page, so there is insufficient in predicting the value of the link page. Considering these problems, the cuckoo search algorithm is proposed to apply to the crawler.
(系统自动生成,下载前可以参看下载内容)

下载文件列表





ThemeCrawler\.classpath

............\.project

............\.settings\org.eclipse.jdt.core.prefs

............\bin\gui\CrawlerFrame.class

............\...\search\Crawler$1.class

............\...\......\Crawler$Task.class

............\...\......\Crawler.class

............\...\......\Download.class

............\...\......\HttpConstants.class

............\...\......\PriorityURL.class

............\...\......\RegularTest.class

............\src\gui\CrawlerFrame.java

............\...\search\Crawler.java

............\...\......\Download.java

............\...\......\HttpConstants.java

............\...\......\PriorityURL.java

............\...\......\RegularTest.java

............\substance.jar

............\bin\gui

............\...\search

............\src\gui

............\...\search

............\.settings

............\bin

............\src

ThemeCrawler

相关说明

  • 本站资源为会员上传分享交流与学习,如有侵犯您的权益,请联系我们删除.
  • 本站是交换下载平台,提供交流渠道,下载内容来自于网络,除下载问题外,其它问题请自行百度更多...
  • 请直接用浏览器下载本站内容,不要使用迅雷之类的下载软件,用WinRAR最新版进行解压.
  • 如果您发现内容无法下载,请稍后再次尝试;或者到消费记录里找到下载记录反馈给我们.
  • 下载后发现下载的内容跟说明不相乎,请到消费记录里找到下载记录反馈给我们,经确认后退回积分.
  • 如下载前有疑问,可以通过点击"提供者"的名字,查看对方的联系方式,联系对方咨询.

相关评论

暂无评论内容.

发表评论

*主  题:
*内  容:
*验 证 码:

源码中国 www.ymcn.org