Package | Description |
---|---|
com.github.peterbencze.serritor.api | |
com.github.peterbencze.serritor.api.event | |
com.github.peterbencze.serritor.internal |
Modifier and Type | Method and Description |
---|---|
CrawlRequest |
CrawlRequest.CrawlRequestBuilder.build()
Builds the configured
CrawlRequest instance. |
static CrawlRequest |
CrawlRequest.createDefault(String requestUrl)
Creates a crawl request with the default configuration.
|
static CrawlRequest |
CrawlRequest.createDefault(URI requestUrl)
Creates a crawl request with the default configuration.
|
Modifier and Type | Method and Description |
---|---|
Set<CrawlRequest> |
CrawlerConfiguration.getCrawlSeeds()
Returns the set of crawl seeds.
|
Modifier and Type | Method and Description |
---|---|
CrawlerConfiguration.CrawlerConfigurationBuilder |
CrawlerConfiguration.CrawlerConfigurationBuilder.addCrawlSeed(CrawlRequest request)
Appends a crawl request to the set of crawl seeds.
|
protected void |
Crawler.crawl(CrawlRequest request)
Feeds a crawl request to the crawler.
|
Modifier and Type | Method and Description |
---|---|
CrawlerConfiguration.CrawlerConfigurationBuilder |
CrawlerConfiguration.CrawlerConfigurationBuilder.addCrawlSeeds(List<CrawlRequest> requests)
Appends a list of crawl requests to the set of crawl seeds.
|
protected void |
Crawler.crawl(List<CrawlRequest> requests)
Feeds multiple crawl requests to the crawler.
|
Constructor and Description |
---|
CrawlCandidateBuilder(CrawlRequest request)
Creates a
CrawlCandidate.CrawlCandidateBuilder instance. |
Modifier and Type | Method and Description |
---|---|
CrawlRequest |
RequestRedirectEvent.getRedirectedCrawlRequest()
Returns the crawl request for the redirected URL.
|
Constructor and Description |
---|
RequestRedirectEvent(CrawlCandidate crawlCandidate,
PartialCrawlResponse partialCrawlResponse,
CrawlRequest redirectedCrawlRequest)
Creates a
RequestRedirectEvent instance. |
Modifier and Type | Method and Description |
---|---|
void |
CrawlFrontier.feedRequest(CrawlRequest request,
boolean isCrawlSeed)
Feeds a crawl request to the frontier.
|
Copyright © 2020. All rights reserved.