spacer.png, 0 kB

Login Form






Lost Password?
No account yet? Register

Tag Cloud

0190   added   available   browse   client   content   crawl   customer   data   different   download   firewall   hadoop   hbase   heritrix   heritrix2   https   jira   jxse   jxta   mining   mule   network   networks   now    peers   plugin   processor   records   release   released   server   svn   thanks   trunk   url   version   web   writer   zookeeper   2008  


spacer.png, 0 kB
Search
Search Keyword content
Total 5 results found. Search for [ content ] with Google

Results 1 - 5 of 5
1. HBase-Writer 0.19.1 Released
(Weblog/General)
...set to "true" and duplicate url records were in the hbase table, Heritrix would not download the content.  Which is fine except, then you cant crawl any new records because you have to ...
Monday, 16 February 2009

2. HBase-Writer 0.19.0 Released
(Weblog/General)
...; This boolean option is set to "false" by default and will crawl and write all urls & their content to the given hbase table (as expected). But by setting this to "true", you ...
Wednesday, 11 February 2009

3. HBase-Writer 0.18.2 Released
(Weblog/General)
HBase-Writer 0.18.2 has been released.  This release contains support for max content size, default max size is 20 MB.  Any content item crawled that is bigger than 20MB will be rejected by
Wednesday, 03 December 2008

4. HBase-Writer 0.18.1 Released
(Weblog/General)
...is ready to write the crawled result of individual urls from Heritrix2, including http headers and the url content, into a given HBase table.  The row key is the url itself, and the content and h...
Tuesday, 28 October 2008

5. 404
(Static Content)
404: Not Found Sorry, but the content you requested could not be found
Thursday, 11 November 2004

<< Start < Prev 1 Next > End >>

spacer.png, 0 kB
spacer.png, 0 kB
 
download components joomla modules free joomla templates
All Content and Images Copyright © opensourcemasters.com 2007 - 2010