Commit graph

36 commits

Author SHA1 Message Date
j
41edea1862 escape strings 2024-09-11 22:52:01 +01:00
j
ae10c5c9b9 more raw regexp strings 2024-08-30 13:30:47 +02:00
j
99554cb461 fix add_link 2024-03-20 12:55:14 +01:00
j
7461719e23 fix links at end of paragraph 2024-03-20 12:50:11 +01:00
j
99e221095b get rid of u string literal 2023-07-27 18:37:28 +02:00
j
d03a6b120d fix sanitize_fragment('\ufeff') 2023-07-27 18:37:28 +02:00
j
adad3be419 drop six and python2 support 2023-07-27 13:07:13 +02:00
j
851204c619 allow target="_blank" for href 2020-10-15 11:39:19 +02:00
j
77f8876fca cleanup pylint errors and py2/3 issues 2016-06-08 15:32:46 +02:00
j
2681536b08 use PY2 2016-01-14 17:09:10 +05:30
j
c49f663d54 py3 does not have string.letters 2015-12-25 20:43:15 +05:30
j
1db297169b basestring->six.string_types 2015-12-25 20:38:55 +05:30
cbcef39ec0 ox.html: fix sanitizing whitespace-only strings
lxml raises:

    ParserError: Document is empty

if you ask it to parse a string with no non-whitespace characters. The
existing truthiness test squashed the commonest case (empty string) but
not the general case.
2015-11-24 18:17:48 +00:00
533a1a627e ox.html.sanitize_fragment: documentation, tests 2015-11-24 18:05:48 +00:00
5448aec902 ox.html.sanitize_html: fix existing tests
The backslashes need to be escaped to come out as literal backslashes in
the Python source code run by doctest.
2015-11-24 17:58:39 +00:00
7c9887410c Allow definition lists in sanitized HTML 2015-09-14 22:47:21 +02:00
j
6dfa80b646 fix ox.image in python3 2014-10-01 10:48:06 +02:00
j
d4d09b56b6 use six to support python 2 and 3 2014-09-30 21:04:46 +02:00
j
34691832eb revert change, fragment_fromstring only parses single element 2014-02-04 10:44:51 +00:00
j
8bda86c17d use fragment_fromstring instead of document_fromstring 2014-02-04 10:40:01 +00:00
j
37cd92dfba fix html cleanup of empty string 2013-12-01 12:35:38 +00:00
j
d664d99f89 rewrite sanitize_html to support global attributes 2013-11-10 22:00:24 +00:00
j
d38da54a17 strip <p> 2013-10-31 13:49:55 +01:00
j
5dcd8b3552 allow iframes in sanitize_html 2013-10-24 16:40:04 +00:00
j
22eecc22e4 allow more html5 tags 2013-08-27 08:51:18 +00:00
j
8ba74a1e4b fix some failing tests 2012-09-09 19:28:11 +02:00
j
bb35daa95c replace all CammelCase with under_score in ox 2012-08-14 16:12:43 +02:00
j
ddecba1d1f cleanup 2012-05-27 13:50:10 +02:00
j
f6b5d6bde8 parse_html->sanitize_html, add links, allow mailto 2012-05-27 13:38:58 +02:00
j
5a00be4b37 apos not in name2codepoint, also decode that 2012-04-24 19:00:48 +02:00
j
6bd249a5ae use local ffmpeg2theora if available 2012-02-24 22:37:55 +05:30
j
60839de99c escape_html/parse_html 2012-02-21 21:14:50 +05:30
j
f1fee46126 episodes without director where series does not have a creator, fix euro entity 2011-10-18 14:25:13 +02:00
j
0d354d2574 rename, use namespaces 2010-07-08 00:34:04 +02:00
j
cff20af4e7 back to oxlib, package_dir does not work with pip/python setup.py develop 2009-10-12 17:21:05 +02:00
j
d14f13faaf install as ox 2009-10-11 14:53:50 +02:00
Renamed from oxlib/html.py (Browse further)