2 Repositories
Java nutch Libraries
Apache Nutch is an extensible and scalable web crawler
Apache Nutch README For the latest information about Nutch, please visit our website at: https://nutch.apache.org/ and our wiki, at: https://cwiki.apa
Dec 31, 2022
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Sparkler A web crawler is a bot program that fetches resources from the web for the sake of building applications like search engines, knowledge bases
Dec 17, 2022