Nutch is a well matured, production ready Web crawler. Nutch enables fine grained configuration, relying on Apache Hadoop™ data structures, which are great for batch processing.
2 years ago
4 Answers
3 Answers
3 years ago
2 Answers
39 Questions - 0 Points
42 Questions - 0 Points
43 Questions - 0 Points
41 Questions - 0 Points
45 Questions - 0 Points