2015-07-13 17:09:39 -07:00
|
|
|
brozzler
|
|
|
|
========
|
|
|
|
"browser" ^ "crawler" = "brozzler"
|
|
|
|
|
|
|
|
Brozzler is a distributed web crawler that uses a real browser (chrome or
|
|
|
|
chromium) to fetch pages and embedded urls and to extract links.
|
|
|
|
|
|
|
|
It is forked from https://github.com/internetarchive/umbra.
|
2014-01-21 21:36:14 +00:00
|
|
|
|
2014-09-02 16:11:49 -07:00
|
|
|
License
|
|
|
|
-------
|
|
|
|
|
2015-07-13 17:09:39 -07:00
|
|
|
Copyright 2015 Internet Archive
|
2014-09-02 16:11:49 -07:00
|
|
|
|
|
|
|
Licensed under the Apache License, Version 2.0 (the "License");
|
|
|
|
you may not use this software except in compliance with the License.
|
|
|
|
You may obtain a copy of the License at
|
|
|
|
|
|
|
|
http://www.apache.org/licenses/LICENSE-2.0
|
|
|
|
|
|
|
|
Unless required by applicable law or agreed to in writing, software
|
|
|
|
distributed under the License is distributed on an "AS IS" BASIS,
|
|
|
|
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
|
|
|
|
See the License for the specific language governing permissions and
|
|
|
|
limitations under the License.
|
2014-01-21 01:43:16 -05:00
|
|
|
|