842 Commits

Author SHA1 Message Date
Noah Levitt
049c2688bf Merge branch 'master' into qa
* master:
  support for BROZZLER_RETHINKDB_SERVERS and BROZZLER_RETHINKDB_DB environment variables, honored by all the brozzler-* commands
2017-01-13 20:27:18 +00:00
Noah Levitt
037723fe2b support for BROZZLER_RETHINKDB_SERVERS and BROZZLER_RETHINKDB_DB environment variables, honored by all the brozzler-* commands 2017-01-13 20:27:09 +00:00
Noah Levitt
58bac8fc83 Merge branch 'master' into qa
* master:
  adapt to exception message from newer versions of chromium (e.g. 57.0.2981.0)
2017-01-13 12:08:09 -08:00
Noah Levitt
77c4dc1116 adapt to exception message from newer versions of chromium (e.g. 57.0.2981.0) 2017-01-13 12:08:00 -08:00
Noah Levitt
87eeaf7888 Merge branch 'master' into qa
* master:
  tests for dismissal of javascript dialogs (alert, prompt, confirm)
  dismiss alerts from the page being browsed (avoids hanging)
2017-01-13 11:46:52 -08:00
Noah Levitt
011d814ee2 tests for dismissal of javascript dialogs (alert, prompt, confirm) 2017-01-13 11:46:42 -08:00
Noah Levitt
d2ed6b97a2 dismiss alerts from the page being browsed (avoids hanging) 2017-01-13 10:27:37 -08:00
Noah Levitt
77b34bd5b2 Merge branch 'master' into qa
* master:
  simpleclicks - only click if element is visible, fixes spinning on moma.org sites
  implement brozzler-list-pages
  new cli utils brozzler-list-jobs and brozzler-list-sites
2017-01-12 23:23:57 -08:00
Noah Levitt
766441e65c simpleclicks - only click if element is visible, fixes spinning on moma.org sites 2017-01-12 23:23:46 -08:00
Noah Levitt
38d9eee68d implement brozzler-list-pages 2017-01-12 08:22:45 +00:00
Noah Levitt
184612332e new cli utils brozzler-list-jobs and brozzler-list-sites 2017-01-12 07:50:58 +00:00
Noah Levitt
3a3eccf00e Merge branch 'master' into qa
* master:
  implement sha1 lookup and url prefix lookup for brozzler-list-captures
2017-01-12 01:26:19 +00:00
Noah Levitt
64a0ea879a implement sha1 lookup and url prefix lookup for brozzler-list-captures 2017-01-12 01:26:09 +00:00
Noah Levitt
a99fc2fbe6 Merge branch 'master' into qa
* master:
  catch exceptions parsing funky urls when scoping and extracting outlinks
2017-01-09 15:18:27 -08:00
Noah Levitt
32097a8f8b catch exceptions parsing funky urls when scoping and extracting outlinks 2017-01-09 15:18:19 -08:00
Noah Levitt
755ca045b6 Merge branch 'master' into qa
* master:
  fix bug where login form would not be detected in some cases when there was a non-login form earlier on the page
2017-01-09 11:40:46 -08:00
Noah Levitt
2486768830 fix bug where login form would not be detected in some cases when there was a non-login form earlier on the page 2017-01-09 11:40:30 -08:00
Noah Levitt
9f2efcbb4c Merge branch 'master' into qa
* master:
  reset browser shutdown flag when starting up
2017-01-06 17:57:21 -08:00
Noah Levitt
d0022fe7bf reset browser shutdown flag when starting up 2017-01-06 17:57:11 -08:00
Noah Levitt
54e6001c49 fix behaviors.yaml, switch simpledo behavior to jinja2 template 2017-01-06 17:19:00 -08:00
Barbara Miller
281bd538bf Merge branch 'simpledo' into qa 2017-01-06 16:34:56 -08:00
Barbara Miller
d861888864 simpledo first draft 2017-01-06 16:33:04 -08:00
Barbara Miller
66dabbadb3 Merge branch 'simpledo' into qa 2017-01-06 16:31:18 -08:00
Barbara Miller
377fab785b simpledo first draft 2017-01-06 16:28:59 -08:00
Noah Levitt
4e7f9f8690 Merge branch 'master' into qa
* master:
  fix oversight including username/password in site config when starting a new job
  restore BrozzlerWorker built-in support for managing its own thread
  restore handling of 420 Reached limit, with a rudimentary test
  add import missing from test
  restore support for on_response and on_request, with an automated test for on_response
2017-01-06 13:03:25 -08:00
Noah Levitt
76b658747e fix oversight including username/password in site config when starting a new job 2017-01-06 13:03:09 -08:00
Noah Levitt
c2704b18be restore BrozzlerWorker built-in support for managing its own thread 2017-01-04 14:57:34 -08:00
Noah Levitt
70b67942a5 restore handling of 420 Reached limit, with a rudimentary test 2016-12-22 13:44:09 -08:00
Noah Levitt
e5fb6cb4b9 add import missing from test 2016-12-21 19:19:34 -08:00
Noah Levitt
eabb0fb114 restore support for on_response and on_request, with an automated test for on_response 2016-12-21 18:35:55 -08:00
Noah Levitt
3922258c50 convert clickGetPDFs and noguchi to jinja2 2016-12-21 18:23:53 -08:00
Noah Levitt
422a5ad726 Merge branch 'master' into qa
* master:
  need $DISPLAY set for test_brozzling.py
  restore handling of "aw snap" or "he's dead jim"
  add seed username/password parameters to job config schema
  loosen the find_available_port test slightly, since it seems to be not 100% predictable for reasons i haven't investigated
  convert mouseovers and simpleclicks to jinja2
  remove obsolete facebook login code
  convert behaviors to jinja2, move them to new subdir js-templates, along with javascript previously stored as a string in browser.py
  add hack for submitting a login form containing an element with name or id "submit", which masks the form submit() method
  how did i miss this file?
  forgot to git add new test data
  detect <input type="email"> as potential username field for login
  generalized support for login doing automatic detection of login form on a page
  yet more refactoring of browser.py, clearer separation of purpose, Browser class manages browsing, sends most of the messages to chrome, WebsockReceiverThread handles messages that come back from chrome
  bump version number in setup.py
  major refactoring of browsing code to make it easier to add functionality
  back to dev version number
  i dub thee 1.1b8
2016-12-21 18:11:56 -08:00
Noah Levitt
c90c73372e need $DISPLAY set for test_brozzling.py 2016-12-21 15:15:03 -08:00
Noah Levitt
f7427219cf restore handling of "aw snap" or "he's dead jim" 2016-12-21 14:21:20 -08:00
Noah Levitt
a5d48a9fdb add seed username/password parameters to job config schema 2016-12-20 18:06:20 -08:00
Noah Levitt
86d6060a2d loosen the find_available_port test slightly, since it seems to be not 100% predictable for reasons i haven't investigated 2016-12-20 17:52:21 -08:00
Noah Levitt
edf0a3a50d convert mouseovers and simpleclicks to jinja2 2016-12-20 17:34:29 -08:00
Noah Levitt
e2dbf68ccd remove obsolete facebook login code 2016-12-20 16:38:11 -08:00
Noah Levitt
a0b61408b9 convert behaviors to jinja2, move them to new subdir js-templates, along with javascript previously stored as a string in browser.py 2016-12-20 16:33:25 -08:00
Noah Levitt
06fd0a0d79 add hack for submitting a login form containing an element with name or id "submit", which masks the form submit() method 2016-12-20 11:24:26 -08:00
Noah Levitt
b24b229cb2 how did i miss this file? 2016-12-20 11:13:48 -08:00
Noah Levitt
7a40822e64 forgot to git add new test data 2016-12-19 18:10:07 -08:00
Noah Levitt
2f8f20bbb4 detect <input type="email"> as potential username field for login 2016-12-19 18:08:10 -08:00
Noah Levitt
86ac48d6c3 generalized support for login doing automatic detection of login form on a page 2016-12-19 17:30:09 -08:00
Noah Levitt
bc6e0d243f yet more refactoring of browser.py, clearer separation of purpose, Browser class manages browsing, sends most of the messages to chrome, WebsockReceiverThread handles messages that come back from chrome 2016-12-16 13:52:12 -08:00
Noah Levitt
534d2e63d6 bump version number in setup.py 2016-12-15 16:43:27 -08:00
Noah Levitt
c71854127d major refactoring of browsing code to make it easier to add functionality 2016-12-15 16:42:45 -08:00
Noah Levitt
ef8bc83928 Merge branch 'refactor-browsing' into qa
* refactor-browsing:
  don't log every little message from chrome
2016-12-15 13:21:38 -08:00
Noah Levitt
cb6a00f4f0 don't log every little message from chrome 2016-12-15 13:21:30 -08:00
Noah Levitt
f6333df6ef back to dev version number 2016-12-15 12:34:26 -08:00