mirror of
https://github.com/internetarchive/brozzler.git
synced 2025-06-12 09:03:20 -04:00
fix mistake in job-conf.rst
This commit is contained in:
parent
411b3f266a
commit
ee8ef23f0c
2 changed files with 30 additions and 30 deletions
|
@ -330,7 +330,7 @@ To generate the rule, brozzler canonicalizes the seed URL using the `urlcanon
|
|||
removes the query string if any, and finally serializes the result in SSURT
|
||||
[1]_ form. For example, a seed URL of
|
||||
``https://www.EXAMPLE.com:443/foo//bar?a=b&c=d#fdiap`` becomes
|
||||
``com,example,www,//https:/foo/bar?a=b&c=d``.
|
||||
``com,example,www,//https:/foo/bar``.
|
||||
|
||||
Brozzler derives its general approach to the seed surt from `heritrix
|
||||
<https://github.com/internetarchive/heritrix3>`_, but differs in a few respects.
|
||||
|
|
2
setup.py
2
setup.py
|
@ -32,7 +32,7 @@ def find_package_data(package):
|
|||
|
||||
setuptools.setup(
|
||||
name='brozzler',
|
||||
version='1.5.3',
|
||||
version='1.5.4',
|
||||
description='Distributed web crawling with browsers',
|
||||
url='https://github.com/internetarchive/brozzler',
|
||||
author='Noah Levitt',
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue