Commit Graph

  • 7783f92ce2 larger chrome window: 1400,900 Barbara Miller 2023-04-26 14:51:19 -07:00
  • 0d4ed6a8be
    bump version Barbara Miller 2023-03-15 15:55:08 -07:00
  • 4e65c2f046
    Merge pull request #253 from internetarchive/yt-dlp-timeout Barbara Miller 2023-03-15 15:54:19 -07:00
  • 0847d93d9e add socket_timeout opt for yt-dlp yt-dlp-timeout Barbara Miller 2023-03-15 14:15:18 -07:00
  • e4ddb79a25
    Merge efe2f628a2e7a11acafbaa040d3f7c5adc87873a into 6b97a9bfb7501cd3746a88be0654515f42d61a47 Anderson Martínez 2022-11-28 05:16:06 -08:00
  • efe2f628a2 feat: implementing browserless Anderson Martinez 2022-11-28 09:04:38 -04:00
  • 3f8e8dee92
    Merge branch 'internetarchive:master' into master Barbara Miller 2022-10-10 19:21:40 -07:00
  • 6b97a9bfb7 args.skip_browserless browserless Barbara Miller 2022-10-10 10:05:05 -07:00
  • f01f08c440 update cli.py brozzler-page too Barbara Miller 2022-10-05 13:14:06 -07:00
  • cde49e2a10 initial update chrome.py Barbara Miller 2022-10-05 11:48:26 -07:00
  • a97bd3826a initial update cli.py Barbara Miller 2022-10-05 11:47:57 -07:00
  • b965c1fdf6 run ydl after browsing page ydl-second Barbara Miller 2022-09-21 16:17:21 -07:00
  • 03a6b15717
    warcprox>=2.4.31 Barbara Miller 2022-08-19 12:50:34 -07:00
  • ddc808710b Merge branch 'master' of github.com:internetarchive/brozzler Barbara Miller 2022-08-18 11:44:45 -07:00
  • a4195e1a83
    bump version Barbara Miller 2022-08-12 10:41:48 -07:00
  • 50c2b424c2
    Merge pull request #248 from vbanos/stealth2 Barbara Miller 2022-08-12 10:40:34 -07:00
  • 60645f7f37
    bump version Barbara Miller 2022-08-05 15:58:55 -07:00
  • 0b60a2e2f3
    Merge pull request #249 from internetarchive/blocks-shrink Barbara Miller 2022-08-05 15:36:34 -07:00
  • 7edb0f11b0 and decode() blocks-shrink Barbara Miller 2022-08-04 16:04:37 -07:00
  • a5ee78e662 zlib compression Barbara Miller 2022-08-02 19:06:12 -07:00
  • b5b7d9d52b Add more stealth evasions Vangelis Banos 2022-07-29 11:21:08 +00:00
  • 39eb80567d
    bump version Barbara Miller 2022-06-22 16:13:59 -07:00
  • fa59a88a26
    Merge pull request #247 from internetarchive/stealth-too Barbara Miller 2022-06-22 16:13:12 -07:00
  • 218a49e824 stealth for brozzler_worker stealth-too Barbara Miller 2022-06-22 14:14:50 -07:00
  • e2b3fef028 Merge branch 'master' of github.com:internetarchive/brozzler Barbara Miller 2022-06-22 14:02:11 -07:00
  • de8d67e1e7
    bump version Barbara Miller 2022-06-20 13:44:42 -07:00
  • fe0aaa1ff6
    Merge pull request #246 from vbanos/stealth Barbara Miller 2022-06-20 13:43:25 -07:00
  • 7a12925004 Add stealth parameter to avoid antibot systems Vangelis Banos 2022-06-17 10:53:12 +00:00
  • 1af9cbf44b Merge branch 'master' of github.com:internetarchive/brozzler Barbara Miller 2022-06-09 20:18:18 -07:00
  • ddf7cb4cbc
    bump version Barbara Miller 2022-06-09 15:14:21 -07:00
  • f2d70e1e25
    Merge pull request #245 from internetarchive/yt-dlp-log Barbara Miller 2022-06-09 15:12:51 -07:00
  • 5d54823496 Merge branch 'master' of github.com:internetarchive/brozzler Barbara Miller 2022-06-09 10:38:42 -07:00
  • 14466a7fb3 'youtube_dl' logger yt-dlp-log Barbara Miller 2022-06-08 14:30:32 -07:00
  • 1de63f0aea
    Merge pull request #244 from internetarchive/yt-dlp-skip-live Adam Miller 2022-04-27 15:29:07 -07:00
  • 66252e17c3
    Merge pull request #243 from internetarchive/adds-hop-path-support Adam Miller 2022-04-26 12:10:43 -07:00
  • eef8a1c432
    Bump version Adam Miller 2022-04-26 09:55:08 -07:00
  • 05826942a9 Style fix Adam Miller 2022-04-20 22:49:18 +00:00
  • b693b8713f skip live streams yt-dlp-skip-live Barbara Miller 2022-04-03 17:50:27 -07:00
  • dbd1bb52e8 Merge branch 'master' of github.com:internetarchive/brozzler Barbara Miller 2022-04-03 16:44:02 -07:00
  • cd16985724 Refactor of hop referrer passing Adam Miller 2022-03-24 21:38:47 +00:00
  • 70bb544389
    bump version Barbara Miller 2022-03-22 13:59:48 -07:00
  • 7ee6ea50d1
    Merge pull request #242 from internetarchive/yt-dlp-03 Barbara Miller 2022-03-22 10:23:58 -07:00
  • d5e41bf9ef skip vimeo special case Barbara Miller 2022-03-22 10:00:18 -07:00
  • c52b4af608 vimeo/M3u8 handling, better logging Barbara Miller 2022-03-21 20:26:20 -07:00
  • d67a05572d prefer video+audio files, debug postprocessor hook Barbara Miller 2022-03-21 13:28:08 -07:00
  • c5eb02578a Merge branch 'master' of github.com:internetarchive/brozzler Barbara Miller 2022-03-11 16:50:21 -08:00
  • f4a9e77b06 Catching edge cases that were avoiding setting hop path information Adam Miller 2022-03-03 00:15:20 +00:00
  • 7ea7e543a6
    Merge pull request #241 from internetarchive/yt-dlp-too Barbara Miller 2022-02-25 15:26:33 -08:00
  • 25bb65a635 brozzler/ydl.py updates yt-dlp-too Barbara Miller 2022-02-23 22:34:47 -08:00
  • 0305db5e69 yt_dlp, not youtube-dl Barbara Miller 2022-02-23 22:32:00 -08:00
  • 7fb94419d8
    Merge branch 'internetarchive:master' into master Barbara Miller 2022-02-23 20:13:28 -08:00
  • 244dee987a Merge branch 'ytdlp' of github.com:galgeek/brozzler into ytdlp Barbara Miller 2022-02-23 15:42:12 -08:00
  • 5e36d97057 tidy youtube-dl references and assembled video methods Barbara Miller 2022-02-14 15:33:23 -08:00
  • 010c029400 ydl.stitch_ups Barbara Miller 2022-02-11 20:03:54 -08:00
  • 70258e68a7 wip: add ffmpeg post-processing Barbara Miller 2022-02-11 16:08:13 -08:00
  • d61cec399e Merge branch 'master' into adds-hop-path-support Adam Miller 2022-02-09 18:10:37 +00:00
  • f93ffd67c3 verbose logging Barbara Miller 2022-01-25 17:13:34 -08:00
  • ad46ea756d format_sort Barbara Miller 2022-01-25 13:15:25 -08:00
  • 0a5b511c57 minimal yt-dlp updates Barbara Miller 2022-01-19 16:53:46 -08:00
  • 7cf9cd071b minimal yt-dlp updates Barbara Miller 2022-01-19 16:53:46 -08:00
  • c1d9419093 minimal yt-dlp updates yt-dlp Barbara Miller 2022-01-19 16:53:46 -08:00
  • d9ac067e41
    bump version, copyright statment Barbara Miller 2022-01-18 17:45:58 -08:00
  • de199e789e
    Merge pull request #237 from vbanos/disable-breakpad Barbara Miller 2022-01-18 17:43:45 -08:00
  • fdc84fb848 Add chrome options --disable-sync and --disable-breakpad Vangelis Banos 2022-01-18 10:09:39 +00:00
  • 040a942ef2 Merge branch 'master' of github.com:internetarchive/brozzler Barbara Miller 2022-01-03 16:37:40 -08:00
  • 427908e821
    Merge pull request #233 from cclauss/codespell Alex Dempsey 2021-10-12 12:34:37 -07:00
  • a5ed291e65 Fix typos Christian Clauss 2021-10-12 10:19:48 +02:00
  • 51b2474b3c Issue #231 - How does worker pick a site after crash? - Configurable claimed limit as it was hard coded to 60. The nodes in case of crash can come back in fairly quick time. Nitin Mishra 2021-10-11 16:17:23 +01:00
  • 0f72233f3b Adding support for hop path information to be stored and passed along to warcprox Adam Miller 2021-08-31 19:44:55 +00:00
  • c77c245f42
    WT-55 / Delay Facebook requests Karl-Rainer Blumenthal 2021-07-13 15:00:39 -04:00
  • 4f301f4e03
    Merge pull request #225 from internetarchive/wt-376-yt-user-page-fix Barbara Miller 2021-06-08 14:43:42 -07:00
  • c311fbb41f
    bump version, update copyright Barbara Miller 2021-05-25 17:14:21 -07:00
  • b59c4395ed
    Merge pull request #223 from vbanos/fix-AddressValueError Barbara Miller 2021-05-25 17:12:35 -07:00
  • 7aabc5f655 Skip invalid outlink Vangelis Banos 2021-05-23 11:31:47 +00:00
  • eabdeb0238 Added user page extractor type to ytdl monkeypatch wt-376-yt-user-page-fix Pravin Visakan 2021-05-04 16:50:38 -07:00
  • e28236c5f8 vagrant: more helpful error when vagrant-disksize plugin is missing Wolfgang Faust 2021-03-20 21:03:05 -07:00
  • 1164bc68ef ansible: fix deprecated use of apt+with_items Wolfgang Faust 2021-03-20 20:59:43 -07:00
  • e872a83d6b Fix problems with Ansible playbook in vagrant Wolfgang Faust 2021-03-20 20:46:43 -07:00
  • 8dce6a9f0a
    pip install warcprox Christian Clauss 2020-12-14 21:07:15 +01:00
  • 4aadff5fc8
    pip install -e . Christian Clauss 2020-12-14 21:02:06 +01:00
  • 921657e032
    flake8 . --builtins=__qualname__ Christian Clauss 2020-12-14 20:54:18 +01:00
  • be34dac3da
    GItHub Action to lint Python code Christian Clauss 2020-12-14 20:44:40 +01:00
  • 6290692ac4 Merge branch 'master' of github.com:internetarchive/brozzler Barbara Miller 2020-12-02 14:34:30 -08:00
  • 0f27c9995a
    bump version Barbara Miller 2020-10-29 17:12:14 -07:00
  • 5005c619f6
    Merge pull request #211 from internetarchive/galgeek-websocket-url-timeout jkafader 2020-10-29 17:08:48 -07:00
  • 11c5cfa865 add param for Chrome.start galgeek-websocket-url-timeout Barbara Miller 2020-10-21 15:39:46 -07:00
  • 2d380cc8f2 Merge branch 'master' of github.com:internetarchive/brozzler Barbara Miller 2020-10-14 22:06:11 -07:00
  • dc50fe1db2
    Merge pull request #212 from internetarchive/bump-version-to-1.5.23 Barbara Miller 2020-10-13 15:21:18 -07:00
  • 052c3552ca
    bump version after merge bump-version-to-1.5.23 Barbara Miller 2020-10-13 15:19:50 -07:00
  • f2ebdca597
    configurable websocket url timeout, default 60 Barbara Miller 2020-10-13 15:12:32 -07:00
  • bb7594a14d
    Merge pull request #209 from vbanos/outlinks-timeout Barbara Miller 2020-10-13 15:01:55 -07:00
  • 6ed234bb68 Correct paths in vagrant scripts Lauren Ko 2020-10-06 14:57:18 -05:00
  • 09fe10307d Remove "proxy" references Lauren Ko 2020-10-06 14:55:03 -05:00
  • 4372752bd6 Merge branch 'qa' of github.com:internetarchive/brozzler Barbara Miller 2020-10-05 14:37:07 -07:00
  • 8addaf31d5 Add option extract_outlinks_timeout Vangelis Banos 2020-10-04 15:39:30 +00:00
  • 18d3f5f930
    Merge pull request #208 from internetarchive/galgeek-patch-2 Barbara Miller 2020-09-21 18:06:03 -07:00
  • 297eaac6dd
    update travis.yml and test! galgeek-patch-2 Barbara Miller 2020-09-21 17:08:39 -07:00
  • c744bb2f92
    update copyright Barbara Miller 2020-09-01 19:05:21 -07:00
  • 9a3c61b9c7
    Requête Christian Clauss 2020-08-16 00:18:10 +02:00
  • 53e643b919 quotes? Barbara Miller 2020-08-15 09:04:38 -07:00