Forkparser

Latest version: v6.1.0

Safety actively analyzes 630254 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 3 of 10

3.3

problem tracking xml:base and xml:lang if element declares it, child
doesn't, first grandchild redeclares it, and second grandchild doesn't;
refactored date parsing; defined public registerDateHandler so callers
can add support for additional date formats at runtime; added support
for OnBlog, Nate, MSSQL, Greek, and Hungarian dates (ytrewq1); added
zopeCompatibilityHack() which turns FeedParserDict into a regular
dictionary, required for Zope compatibility, and also makes command-
line debugging easier because pprint module formats real dictionaries
better than dictionary-like objects; added NonXMLContentType exception,
which is stored in bozo_exception when a feed is served with a non-XML
media type such as 'text/plain'; respect Content-Language as default
language if not xml:lang is present; cloud dict is now FeedParserDict;
generator dict is now FeedParserDict; better tracking of xml:lang,
including support for xml:lang='' to unset the current language;
recognize RSS 1.0 feeds even when RSS 1.0 namespace is not the default
namespace; don't overwrite final status on redirects (scenarios:
redirecting to a URL that returns 304, redirecting to a URL that
redirects to another URL with a different type of redirect); add
support for HTTP 303 redirects

3.2

convert feed to UTF-8 before passing to XML parser; completely revamped
logic for determining character encoding and attempting XML parsing
(much faster); increased default timeout to 20 seconds; test for presence
of Location header on redirects; added tests for many alternate character
encodings; support various EBCDIC encodings; support UTF-16BE and
UTF16-LE with or without a BOM; support UTF-8 with a BOM; support
UTF-32BE and UTF-32LE with or without a BOM; fixed crashing bug if no
XML parsers are available; added support for 'Content-encoding: deflate';
send blank 'Accept-encoding: ' header if neither gzip nor zlib modules
are available

3.1

to Unicode equivalents in illformed feeds (aaronsw); added and
passed tests for converting character entities to Unicode equivalents
in illformed feeds (aaronsw); test for valid parsers when setting
XML_AVAILABLE; make version and encoding available when server returns
a 304; add handlers parameter to pass arbitrary urllib2 handlers (like
digest auth or proxy support); add code to parse username/password
out of url and send as basic authentication; expose downloading-related
exceptions in bozo_exception (aaronsw); added __contains__ method to
FeedParserDict (aaronsw); added publisher_detail (aaronsw)

3.0.1

recover from malformed content-type header parameter with no equals sign
('text/xml; charset:iso-8859-1')

3.0

iso-8859-1 and windows-1252 anyway, and most incorrectly marked feeds are
windows-1252); fixed regression that could cause the same encoding to be
tried twice (even if it failed the first time)

3.0b23

high-bit characters in attributes in embedded HTML in description (thanks
Thijs van de Vossen); moved guid, date, and date_parsed to mapped keys in
FeedParserDict; tweaked FeedParserDict.has_key to return True if asking
about a mapped key

3.0fc1 - 4/23/2004 - MAP - made results.entries[0].links[0] and
results.entries[0].enclosures[0] into FeedParserDict; fixed typo that could
cause the same encoding to be tried twice (even if it failed the first time);
fixed DOCTYPE stripping when DOCTYPE contained entity declarations;
better textinput and image tracking in illformed RSS 1.0 feeds

3.0fc2 - 5/10/2004 - MAP - added and passed Sam's amp tests; added and passed
my blink tag tests

3.0fc3 - 6/18/2004 - MAP - fixed bug in _changeEncodingDeclaration that
failed to parse utf-16 encoded feeds; made source into a FeedParserDict;
duplicate admin:generatorAgent/rdf:resource in generator_detail.url;
added support for image; refactored parse() fallback logic to try other
encodings if SAX parsing fails (previously it would only try other encodings
if re-encoding failed); remove unichr madness in normalize_attrs now that
we're properly tracking encoding in and out of BaseHTMLProcessor; set
feed.language from root-level xml:lang; set entry.id from rdf:about;
send Accept header

Page 3 of 10

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.