solr - Nutch (2.2.1) Inject Urls Hangs -
i'm running ubuntu 14.04, i'm tying basic nutch web crawl running no avail. following this tutorial set following building blocks:
- ubuntu 14.04
- hbase 0.90.4
- nutch 2.2.1
- solr 4.3.1
i confirm both hbase , solr running, populate urls/seed.txt file. when call;
bin/nutch inject urls i'm presented following output , seems nutch hangs.
injectorjob: starting @ 2014-06-09 23:38:49 injectorjob: injecting urldir: urls/seed.txt this stackoverflow question seems similar mine, not behind proxy answer not applicable.
any in resolving issue appreciated.
ubuntu defaults loopback ip address in hosts 127.0.1.1. hbase (according this page) requires loopback ip address 127.0.0.1.
the ubuntu /etc/hosts file default contains (with mycomputername being computer name):
127.0.0.1 localhost 127.0.1.1 mycomputername use sudo gedit /etc/hosts update hosts file follow:
127.0.0.1 localhost 127.0.0.1 mycomputername reboot ubuntu. nutch should no longer have trouble injecting urls hbase.
Comments
Post a Comment