Description
Add facility to configure default or fixed fetchInterval values by MIME-type. This is useful for conserving resources for files that are known to change frequently or never and everything in between.
- simple key\tvalue\n configuration file
- only set fetchInterval for new documents
- keep max fetchInterval fixed by current config
Attachments
Attachments
Issue Links
- depends upon
-
NUTCH-779 Mechanism for passing metadata from parse to crawldb
- Closed