diff options
| author | Philipp Tanlak <philipp.tanlak@gmail.com> | 2023-11-13 22:36:15 +0100 |
|---|---|---|
| committer | GitHub <noreply@github.com> | 2023-11-13 22:36:15 +0100 |
| commit | 190056ee8d6a4eca61d92a79cc25aad645e69d4a (patch) | |
| tree | 423cb3dfb7ca92e4c1c48c1070f553bbadc4d890 /docs/configuration/url-filter.md | |
| parent | eae10426cd805ecc0a0459b61639e48e6cd913ad (diff) | |
Move docs to flyscrape.com (#11)
Diffstat (limited to 'docs/configuration/url-filter.md')
| -rw-r--r-- | docs/configuration/url-filter.md | 42 |
1 files changed, 0 insertions, 42 deletions
diff --git a/docs/configuration/url-filter.md b/docs/configuration/url-filter.md deleted file mode 100644 index e2feda8..0000000 --- a/docs/configuration/url-filter.md +++ /dev/null @@ -1,42 +0,0 @@ -# URL Filter - -The `allowedURLs` and `blockedURLs` config options allow you to specify a list of URL patterns (in form of regular expressions) which are accessible or blocked during scraping. - -```javascript -export const options = { - url: "http://example.com/", - allowedURLs: ["/articles/.*", "/authors/.*"], - blockedURLs: ["/authors/admin"], - // ... -}; -``` - -### `allowedURLs` - -This config option controls which URLs are allowed to be visted during scraping. When no value is provided all URLs are allowed to be visited if not otherwise blocked. - -When a list of URL patterns is provided, only URLs matching one or more of these patterns are allowed to be visted. - -Example: - -```javascript -export const options = { - url: "http://example.com/", - allowedURLs: ["/products/"], -}; -``` - -### `blockedURLs` - -This config option controls which URLs are blocked from being visted during scraping. - -When a list of URL patterns is provided, URLs matching one or more of these patterns are blocked from to be visted. - -Example: - -```javascript -export const options = { - url: "http://example.com/", - blockedURLs: ["/restricted"], -}; -``` |