mirror of
https://github.com/internetarchive/brozzler.git
synced 2025-06-06 22:29:11 -04:00
fix mistake in job-conf.rst
This commit is contained in:
parent
411b3f266a
commit
ee8ef23f0c
2 changed files with 30 additions and 30 deletions
|
@ -330,7 +330,7 @@ To generate the rule, brozzler canonicalizes the seed URL using the `urlcanon
|
||||||
removes the query string if any, and finally serializes the result in SSURT
|
removes the query string if any, and finally serializes the result in SSURT
|
||||||
[1]_ form. For example, a seed URL of
|
[1]_ form. For example, a seed URL of
|
||||||
``https://www.EXAMPLE.com:443/foo//bar?a=b&c=d#fdiap`` becomes
|
``https://www.EXAMPLE.com:443/foo//bar?a=b&c=d#fdiap`` becomes
|
||||||
``com,example,www,//https:/foo/bar?a=b&c=d``.
|
``com,example,www,//https:/foo/bar``.
|
||||||
|
|
||||||
Brozzler derives its general approach to the seed surt from `heritrix
|
Brozzler derives its general approach to the seed surt from `heritrix
|
||||||
<https://github.com/internetarchive/heritrix3>`_, but differs in a few respects.
|
<https://github.com/internetarchive/heritrix3>`_, but differs in a few respects.
|
||||||
|
|
2
setup.py
2
setup.py
|
@ -32,7 +32,7 @@ def find_package_data(package):
|
||||||
|
|
||||||
setuptools.setup(
|
setuptools.setup(
|
||||||
name='brozzler',
|
name='brozzler',
|
||||||
version='1.5.3',
|
version='1.5.4',
|
||||||
description='Distributed web crawling with browsers',
|
description='Distributed web crawling with browsers',
|
||||||
url='https://github.com/internetarchive/brozzler',
|
url='https://github.com/internetarchive/brozzler',
|
||||||
author='Noah Levitt',
|
author='Noah Levitt',
|
||||||
|
|
Loading…
Add table
Add a link
Reference in a new issue