mirror of
https://github.com/internetarchive/brozzler.git
synced 2025-05-02 14:46:18 -04:00
if youtube-dl fetches pages or makes HEAD requests, look at the responses to determine if the page is html and therefore needs to be browsed; if it doesn't need to be browsed, check if youtube-dl has already fetched it (GET request to final bounce of redirect chain that returned a 200); if not, simply fetch it
This commit is contained in:
parent
ed0ea24de6
commit
ecb2e44442
2 changed files with 115 additions and 17 deletions
2
setup.py
2
setup.py
|
@ -2,7 +2,7 @@ import setuptools
|
|||
import glob
|
||||
|
||||
setuptools.setup(name='brozzler',
|
||||
version='1.1.dev1',
|
||||
version='1.1.dev2',
|
||||
description='Distributed web crawling with browsers',
|
||||
url='https://github.com/nlevitt/brozzler',
|
||||
author='Noah Levitt',
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue