Commit Graph

  • 23601e2e0a Merge branch 'master' into qa Noah Levitt 2017-02-22 10:57:27 -08:00
  • 3c4ab834da handle errors from extract-outlinks.js, which happens on polyvore.com because it changes the definition of Set 😭 Noah Levitt 2017-02-22 10:57:11 -08:00
  • 675e2f53b8 Merge branch 'master' into qa Noah Levitt 2017-02-16 10:21:01 -08:00
  • 0d0da22613 brozzler-list-jobs --yaml Noah Levitt 2017-02-16 10:20:36 -08:00
  • a0298898be Merge branch 'master' into qa Noah Levitt 2017-02-15 23:21:06 -08:00
  • f02d4ed40e missed this in the last commit Noah Levitt 2017-02-15 23:20:47 -08:00
  • 0ee14a8e6f Merge branch 'master' into qa Noah Levitt 2017-02-15 16:47:01 -08:00
  • b409e49cfa deprecate current scope rule syntax and create new syntax with slightly different semantics (to be documented), and add parent_url_regex scope rule; unit test for scoping Noah Levitt 2017-02-15 16:46:45 -08:00
  • f6fdb91d57 Merge branch 'master' into qa Noah Levitt 2017-02-15 23:13:32 +00:00
  • c0057e591a add --yaml option to brozzler-list-* commands Noah Levitt 2017-02-15 23:13:09 +00:00
  • 1054e8e3cb take screenshot before running behavior (but after login) - thanks danielbicho Noah Levitt 2017-02-15 09:13:44 -08:00
  • 08752a5163 Merge branch 'master' into qa Noah Levitt 2017-02-10 15:19:35 -08:00
  • e58f4b7c44 logging tweaks Noah Levitt 2017-02-10 15:19:28 -08:00
  • f990adfa63 post-merge tidying Barbara Miller 2017-02-05 22:03:28 -08:00
  • 46ee409de8 Merge branch 'simpledo' into qa Barbara Miller 2017-02-05 22:01:07 -08:00
  • d4fdd089a3 finish jinja2 update Barbara Miller 2017-02-05 21:55:11 -08:00
  • 91205e6108 isVisible Barbara Miller 2017-01-13 11:19:24 -08:00
  • 01f5343b48 fix behaviors.yaml, switch simpledo behavior to jinja2 template Noah Levitt 2017-01-06 17:19:00 -08:00
  • 3851daa27d update for new behaviors set up Barbara Miller 2017-01-06 17:12:54 -08:00
  • 7296acdc19 simpledo first draft Barbara Miller 2016-12-10 17:23:01 -08:00
  • 1501d572da isVisible Barbara Miller 2017-01-13 11:19:24 -08:00
  • 98144a2180 fix behaviors.yaml, switch simpledo behavior to jinja2 template Noah Levitt 2017-01-06 17:19:00 -08:00
  • dcc2dae8c1 update for new behaviors set up Barbara Miller 2017-01-06 17:12:54 -08:00
  • c627d7ee84 simpledo first draft Barbara Miller 2016-12-10 17:23:01 -08:00
  • 0ff6257bef Merge branch 'qa' of github.com:internetarchive/brozzler into qa Barbara Miller 2017-02-05 21:49:40 -08:00
  • aa22594928 Merge branch 'master' into qa Noah Levitt 2017-02-03 17:24:53 -08:00
  • 09fa41f959 fix TypeError: not all arguments converted during string formatting Noah Levitt 2017-02-03 17:24:47 -08:00
  • 7402657ae1 Merge branch 'master' into qa Noah Levitt 2017-02-03 16:40:22 -08:00
  • 14e312e4c4 make sure site is not "claimed" when it's finished Noah Levitt 2017-02-03 16:40:15 -08:00
  • 3783e49cb4 Merge branch 'master' into qa Noah Levitt 2017-02-03 14:56:21 -08:00
  • a60878c5a7 support for resuming jobs, keeping track of each start and stop time, used to enforce time limits correctly Noah Levitt 2017-02-03 14:56:12 -08:00
  • 5a0301ac12 let rethinkdb generate job.id if not supplied in configuration Noah Levitt 2017-02-03 14:53:50 -08:00
  • 7f885758ae Merge branch 'master' into qa Noah Levitt 2017-02-02 11:52:28 -08:00
  • 129a1e8f47 use underscore convention Noah Levitt 2017-02-02 11:52:19 -08:00
  • a19700b975 Merge branch 'master' into qa Noah Levitt 2017-02-02 11:43:54 -08:00
  • 5f4c5190da improve TRACE level logging Noah Levitt 2017-02-02 11:41:40 -08:00
  • 87de7750e6 Merge branch 'master' into qa Noah Levitt 2017-02-01 22:39:29 +00:00
  • ed2d58d87d stopgap fix for problem where an attempt to save a screenshot of a url with a hash tag containing spaces or non-ascii characters would fail, causing the whole brozzle of the page to fail, and end up in a retry loop (better handling of hash tags is planned which will obviate this change) Noah Levitt 2017-02-01 22:39:12 +00:00
  • 1874434d60 Merge branch 'master' into qa Noah Levitt 2017-01-31 10:26:48 -08:00
  • 5c684779e5 pywb support for thumbnail: and screenshot: urls Noah Levitt 2017-01-31 10:26:38 -08:00
  • 8f5003b784 fix oops Noah Levitt 2017-01-30 23:47:39 -08:00
  • 6bf8cfe893 Merge branch 'qa' of github.com:internetarchive/brozzler into qa Barbara Miller 2017-01-30 20:52:58 -08:00
  • 8df96ceb59 Merge branch 'master' into qa Noah Levitt 2017-01-30 11:39:37 -08:00
  • 4b6831b464 new flag Page.blocked_by_robots Noah Levitt 2017-01-30 10:43:25 -08:00
  • a8b564f100 be more patient to avoid spurious warnings waiting for browser to start up Noah Levitt 2017-01-24 10:06:37 -08:00
  • 8c116295ea Merge branch 'master' into qa Noah Levitt 2017-01-24 09:56:14 -08:00
  • d22cc075e0 restore ping_timeout argument to WebSocketApp.run_forever to fix problem of leaking websocket receiver threads hanging forever on select() Noah Levitt 2017-01-24 09:55:56 -08:00
  • 5375b819dd missed a spot Noah Levitt 2017-01-20 23:59:31 -08:00
  • c3b637d244 improve brozzler-dashboard logging; fix default wayback baseurl in brozzler dashboard (https://github.com/internetarchive/brozzler/issues/31); tweak arg parsing related stuff Noah Levitt 2017-01-20 23:41:59 -08:00
  • 095456aa27 avoid js errors in case site or job is not configured to keep stats Noah Levitt 2017-01-20 23:36:23 -08:00
  • 65f818e901 add travis-ci slack notification to internetarchive/brozzler channel Noah Levitt 2017-01-16 12:44:12 -08:00
  • 049c2688bf Merge branch 'master' into qa Noah Levitt 2017-01-13 20:27:18 +00:00
  • 037723fe2b support for BROZZLER_RETHINKDB_SERVERS and BROZZLER_RETHINKDB_DB environment variables, honored by all the brozzler-* commands Noah Levitt 2017-01-13 20:27:09 +00:00
  • 58bac8fc83 Merge branch 'master' into qa Noah Levitt 2017-01-13 12:08:09 -08:00
  • 77c4dc1116 adapt to exception message from newer versions of chromium (e.g. 57.0.2981.0) Noah Levitt 2017-01-13 12:08:00 -08:00
  • 87eeaf7888 Merge branch 'master' into qa Noah Levitt 2017-01-13 11:46:52 -08:00
  • 011d814ee2 tests for dismissal of javascript dialogs (alert, prompt, confirm) Noah Levitt 2017-01-13 11:46:42 -08:00
  • c9982e8727 Merge branch 'simpledo' into qa Barbara Miller 2017-01-13 11:19:55 -08:00
  • de55b54d14 isVisible simpledo Barbara Miller 2017-01-13 11:19:24 -08:00
  • d2ed6b97a2 dismiss alerts from the page being browsed (avoids hanging) Noah Levitt 2017-01-13 10:27:37 -08:00
  • 77b34bd5b2 Merge branch 'master' into qa Noah Levitt 2017-01-12 23:23:57 -08:00
  • 766441e65c simpleclicks - only click if element is visible, fixes spinning on moma.org sites Noah Levitt 2017-01-12 23:23:46 -08:00
  • 38d9eee68d implement brozzler-list-pages Noah Levitt 2017-01-12 08:22:45 +00:00
  • 184612332e new cli utils brozzler-list-jobs and brozzler-list-sites Noah Levitt 2017-01-12 07:50:58 +00:00
  • 3a3eccf00e Merge branch 'master' into qa Noah Levitt 2017-01-12 01:26:19 +00:00
  • 64a0ea879a implement sha1 lookup and url prefix lookup for brozzler-list-captures Noah Levitt 2017-01-12 01:26:09 +00:00
  • a99fc2fbe6 Merge branch 'master' into qa Noah Levitt 2017-01-09 15:18:27 -08:00
  • 32097a8f8b catch exceptions parsing funky urls when scoping and extracting outlinks Noah Levitt 2017-01-09 15:18:19 -08:00
  • 755ca045b6 Merge branch 'master' into qa Noah Levitt 2017-01-09 11:40:46 -08:00
  • 2486768830 fix bug where login form would not be detected in some cases when there was a non-login form earlier on the page Noah Levitt 2017-01-09 11:40:30 -08:00
  • 9f2efcbb4c Merge branch 'master' into qa Noah Levitt 2017-01-06 17:57:21 -08:00
  • d0022fe7bf reset browser shutdown flag when starting up Noah Levitt 2017-01-06 17:57:11 -08:00
  • bd5d4a8c97 fix behaviors.yaml, switch simpledo behavior to jinja2 template Noah Levitt 2017-01-06 17:19:00 -08:00
  • 54e6001c49 fix behaviors.yaml, switch simpledo behavior to jinja2 template Noah Levitt 2017-01-06 17:19:00 -08:00
  • 501aec77fc update for new behaviors set up Barbara Miller 2017-01-06 17:12:54 -08:00
  • 281bd538bf Merge branch 'simpledo' into qa Barbara Miller 2017-01-06 16:34:56 -08:00
  • d861888864 simpledo first draft Barbara Miller 2016-12-10 17:23:01 -08:00
  • 66dabbadb3 Merge branch 'simpledo' into qa Barbara Miller 2017-01-06 16:31:18 -08:00
  • 377fab785b simpledo first draft Barbara Miller 2016-12-10 17:23:01 -08:00
  • 4e7f9f8690 Merge branch 'master' into qa Noah Levitt 2017-01-06 13:03:25 -08:00
  • 76b658747e fix oversight including username/password in site config when starting a new job Noah Levitt 2017-01-06 13:03:09 -08:00
  • c2704b18be restore BrozzlerWorker built-in support for managing its own thread Noah Levitt 2017-01-04 14:57:34 -08:00
  • 70b67942a5 restore handling of 420 Reached limit, with a rudimentary test Noah Levitt 2016-12-22 13:44:09 -08:00
  • e5fb6cb4b9 add import missing from test Noah Levitt 2016-12-21 19:19:34 -08:00
  • eabb0fb114 restore support for on_response and on_request, with an automated test for on_response Noah Levitt 2016-12-21 18:35:55 -08:00
  • 3922258c50 convert clickGetPDFs and noguchi to jinja2 Noah Levitt 2016-12-21 18:23:53 -08:00
  • 422a5ad726 Merge branch 'master' into qa Noah Levitt 2016-12-21 18:11:56 -08:00
  • c90c73372e need $DISPLAY set for test_brozzling.py Noah Levitt 2016-12-21 15:15:03 -08:00
  • f7427219cf restore handling of "aw snap" or "he's dead jim" Noah Levitt 2016-12-21 14:21:20 -08:00
  • a5d48a9fdb add seed username/password parameters to job config schema Noah Levitt 2016-12-20 18:06:20 -08:00
  • 86d6060a2d loosen the find_available_port test slightly, since it seems to be not 100% predictable for reasons i haven't investigated Noah Levitt 2016-12-20 17:52:21 -08:00
  • edf0a3a50d convert mouseovers and simpleclicks to jinja2 Noah Levitt 2016-12-20 17:34:29 -08:00
  • e2dbf68ccd remove obsolete facebook login code Noah Levitt 2016-12-20 16:38:11 -08:00
  • a0b61408b9 convert behaviors to jinja2, move them to new subdir js-templates, along with javascript previously stored as a string in browser.py Noah Levitt 2016-12-20 16:33:25 -08:00
  • fd8b25ae79 Merge 1dcbaeda39bdb6488e3927ad2f547a7a3afb4090 into 06fd0a0d793b589672be7e0c9b40c0a3789dc90c Barbara Miller 2016-12-20 19:24:31 +00:00
  • 06fd0a0d79 add hack for submitting a login form containing an element with name or id "submit", which masks the form submit() method Noah Levitt 2016-12-20 11:24:26 -08:00
  • b24b229cb2 how did i miss this file? Noah Levitt 2016-12-20 11:13:48 -08:00
  • 7a40822e64 forgot to git add new test data Noah Levitt 2016-12-19 18:10:07 -08:00
  • 2f8f20bbb4 detect <input type="email"> as potential username field for login Noah Levitt 2016-12-19 18:08:10 -08:00
  • 86ac48d6c3 generalized support for login doing automatic detection of login form on a page Noah Levitt 2016-12-19 17:30:09 -08:00