spacer.png, 0 kB

Login Form






Lost Password?
No account yet? Register

Tag Cloud

0190   added   available   browse   client   content   crawl   customer   data   different   download   firewall   hadoop   hbase   heritrix   heritrix2   https   jira   jxse   jxta   mining   mule   network   networks   now    peers   plugin   processor   records   release   released   server   svn   thanks   trunk   url   version   web   writer   zookeeper   2008  


spacer.png, 0 kB
HBase-Writer 0.18.2 Released PDF Print E-mail
Written by Ryan Smith   
Wednesday, 03 December 2008

HBase-Writer 0.18.2 has been released.  This release contains support for max content size, default max size is 20 MB.  Any content item crawled that is bigger than 20MB will be rejected by the writer.  This release also contains a bug fix;  If HBase throws an exception, the writer wasnt being added back to the Heritrix writerpool.  The writer is now being added back.  Thanks to Andrew Purtell at Apache for these patches. 


HBase-Writer is a processor plugin following the Heritrix2 processor API.  With HBase-Writer, you can have Heritrix2 crawl and save its results directly to a table in HBase.  The HBase-Writer plugin was based off the Heritrix-HDFS-Writer plugin.  Thanks to Questio for the support in releasing this project.

 
< Prev   Next >
spacer.png, 0 kB
spacer.png, 0 kB
 
download components joomla modules free joomla templates
All Content and Images Copyright © opensourcemasters.com 2007 - 2010