2 Repositories
Java scraping Libraries
jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
jsoup: Java HTML Parser jsoup is a Java library for working with real-world HTML. It provides a very convenient API for fetching URLs and extracting a
Jan 4, 2023
A scalable web crawler framework for Java.
Readme in Chinese A scalable crawler framework. It covers the whole lifecycle of crawler: downloading, url management, content extraction and persiste
Jan 5, 2023