You can control which pages FetchFox visits during a crawl
priority
parameter to control which pages FetchFox visits during a crawl, and in which order it visits them.
The priority
parameter has four fields, each of which can be a list of URL patterns.
only
can be used to specifically whitelist certain URL patterns. If this field is defined, then FetchFox will only visit URLs that match at least one of those URL patterns.skip
can be used to specifically blacklist certain URL patterns. If this field is defined, then those URLs matching any of the patterns in the list will be skipped. They will be skipped even if they match another pattern in priority definition.high
can be used to mark certain URL patterns as high priority. If this field is defined, then FetchFox will prioritize visiting URLs that match any of the patterns in the list.low
can be used to mark certain URL patterns as low priority. If this field is defined, then FetchFox will place low priority on visiting URLs that match any of the patterns in the list.priority
to define all of these preferences, as shown in the example below.
priority
controls which pages FetchFox visits. The parameter doese not affect which URLs are found in results.hits
.