Pywb

Latest version: v2.8.3

Safety actively analyzes 630217 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 11 of 16

0.10.0

Not secure
~~~~~~~~~~~~~~~~~~~~~~

* Per-collection cacheing settings: ``rewrite_opts.http_cache`` can be set to:

- ``pass`` - keep cacheing headers as-is (applies to ``Cache-Control``, ``Expires``, ``Etag`` and ``Last-Modified``)
- ``0`` - add ``Cache-Control: no-cache; no-store``
- ``N`` - add ``Cache-Control: max-age=N`` and corresponding ``Expires`` header
- None (default) -- Rewrite cache headers, effectively removing them (current behavior)

* New improved Wombat, including:

- better handling of new iframes set to ``about:blank``, add all overrides
- createElement() override (can be disabled)
- innerHTML prototype override (can be disabled)

* Rules: Improved rewriting for Google+, Twitter, YT comments

* Video: Improved support for LiveStream playlist, detect newly added <object> and <embed> videos (with mutation observers)

* Indexing: Add contents of ``WARC-Json-Metadata`` to ``metadata`` field in cdx-json

* Buffering: Only buffer when content-length is missing and only up-to first 16K

* ZipNum: Fix bug with contents of last block being inaccessible, improved test coverage for zipnum.

0.9.8

Not secure
~~~~~~~~~~~~~~~~~~~~~

* auto config: allow custom settings set in shared ``config.yaml`` to be used with automatic collections.

* wombat fixes: fixes situation where setAttribute was not being rewritten.

* wombat fixes: obey ``_no_rewrite==true`` more consistently in rewrite_elem

* wombat fixes: remove incorrect timezone offset in Date override.

* wombat: new 'node added' mutation observer which will rewrite any newly added elements, may simplify other
rewriting cases. Not enabled by default yet requires setting ``client.use_node_observers`` to use.

* regex rewrite: tweak ``top`` and scheme relative regexes to better avoid false positives

* html rewrite: handle ``parse_comments`` by rewriting as html, instead of as javascript.

* html rewrite: if html content has no <head> tags and no body tags, insert head_insert at end of document.

* html rewrite: don't insert banner in ajax requests, wombat always adds ``X-Requested-With: XMLHttpRequest``.

* scheme relative urls: rewrite to current scheme, if known, otherwise keep scheme relative, instead of defaulting to http.

0.9.7

Not secure
~~~~~~~~~~~~~~~~~~~~~

* wombat enchancements: support for mutation observers instead of ``setAttribute`` override with ``client.use_attr_observers`` setting.
Can also disable worker override with ``skip_disable_worker``

* wombat fixes: Better check for self-redirect when proxying ``replace()`` and ``assign()``, use ``querySelectorAll()`` for dom selection

* wombat fixes: Don't remove trailing slash in ``extract_orig()``, treat slash and no-slash urls as distinct on the client (as expected).

* cdx-indexer: Validation of HTTP protocol and request verbs now optional. Any protocol and verb will be accepted, unless ``-v`` flag is used,
allowing for indexing of content with custom verbs, unexpected protocol, etc...

0.9.6

Not secure
~~~~~~~~~~~~~~~~~~~~~

* framed replay: fix bug where outer frame url was not updated (in inverse mode) after navigating inner frame.

* framed replay: lookup frame by id, ``replay_iframe``, instead of by using ``window.frames[0]`` to allow for more customization.

* fix typo in wombat ``no_rewrite_prefixes``

0.9.5

Not secure
~~~~~~~~~~~~~~~~~~~~~

* s3 loading: support ``s3://`` scheme in block loader, allowing for loading index and archive files from s3. ``boto`` library must be installed separately
via ``pip install boto``. Attempt default boto auth path, and if that fails, attempt anonymous s3 connection.

* Wombat/Client-Side Rewrite Customizations: New ``rewrite_opts.client`` settings from ``config.yaml`` are passed directly to wombat as json.

Allows for customizing wombat as needed. Currently supported options are: ``no_rewrite_prefixes`` for ignoring rewrite
on certain domains, and ``skip_dom``, ``skip_setAttribute`` and ``skip_postmessage`` options for disabling
those overrides. Example usage in config:

::

rewrite_opts:
...
client:
no_rewrite_prefixes: ['http://dont-rewrite-this.example.com/']

skip_setAttribute: true
skip_dom: true
skip_postmessage: true


* Revamp template setup: All templates now use shared env, which is created on first use or can be explicitly set (if embedding)
via ``J2TemplateView.init_shared_env()`` call. Support for specifiying a base env, as well as custom template lookup paths also provided

* Template lookup paths can also be set via config options ``templates_dirs``. The default list is: ``templates``, ``.``, ``/`` in that order.

* Embedding improvements: move custom env (``REL_REQUEST_URI`` setup) into routers, should be able to call router created by ``create_wb_router()``
directly with WSGI enviorn and receive a callable response.

* Embedding improvements: If set, the contents of ``environ['pywb.template_params']`` dictionary are added directly to Jinja context, allowing for custom template
params to be passed to pywb jinja templates.

* Root collection support: Can specify a route with `''` which will be the root collection. Fix routing paths to ensure root collection is checked last.

* Customization: support custom route_class for cdx server and pass wbrequest to ``not_found_html`` error handlers.

* Manager: Validate collection names to start with word char and contain alphanum or dash only.

* CLI refactor: easier to create custom cli apps and pass params, inherit shared params. ``live-rewrite-server`` uses new system cli system,
defaults to framed inverse mode. Also runs on ``/live/`` path by default. See ``live-rewrite-server -h`` for a list of current options.

* Add ``cookie_scope: removeall`` cookie rewriter, which will, remove all cookies from replay headers.

* Security: disable file:// altogether for live rewrite path.

* Fuzzy match: better support for custom replace string >1 character: leave string, and strip remainder before fuzzy query.

* Urlrewriter and wburl fixes for various corner cases.

* Rangecache: use url as key if digest not present.

* Framed replay: attempt to mitigate chrome OS X scrolling issue by disabling ``-webkit-transform: none`` in framed mode.
Improves scrolling on many pages but not always consistent (a chrome bug).

0.9.3

Not secure
~~~~~~~~~~~~~~~~~~~~~

* framed replay mode: support ``framed_replay: inverse`` where the top frame is the canonical archival url and the inner frame has ``mp_`` modifier.

* wb.js: improved redirect check: only redirect to top frame in framed mode and compare decoded urls.

* charset detection: read first 1024 bytes to determine charset and add to ``Content-Type`` header if no charset is specified there.

* indexing: support indexing of WARC records with ``urn:`` values as target uris, such as those created by `wpull <https://github.com/chfoo/wpull>`_

* remove certauth module: now using standalone `certauth <http://github.com/ikreymer/certauth>`_ package.

* BlockLoader: use ``requests`` instead of ``urllib2``.

* cdx: %-encode any non-ascii chars found in cdx fields.

* cdx: showNumPages query always return valid result (not 404) for 0 pages. If <1 block, load cdx to determine if 1 page or none.

Page 11 of 16

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.