Commit graph

792 commits

Author SHA1 Message Date
j
4e7898ae57 avoid dict as default value 2016-06-08 15:30:25 +02:00
j
170af83272 dont use temp variables to compute ox.image.ZONE_INDEX 2016-06-08 12:35:33 +02:00
j
51da4fd809 fix python3 ox.text 2016-06-08 12:27:55 +02:00
j
ac2e829016 fix python3 ox.srt 2016-06-08 11:36:55 +02:00
j
1e3d2d24bb raise Exception(...) 2016-06-04 00:39:01 +02:00
j
7695a9c015 fix some tests and urls 2016-05-21 15:19:25 +02:00
5355dbf821 Add WebVTT output support
This subset of the format is almost identical to SRT, but I think it's
cleaner to have a separate module (at the cost of a little bit of
copy-pasta).
2016-03-11 12:14:50 +00:00
b75a0f9bb8 srt: neater docstrings, some cleanup 2016-03-11 12:10:32 +00:00
j
959931114b add rmvb as video extension 2016-03-01 14:44:47 +05:30
j
ec1e5459f6 remove ox.django 2016-02-20 17:51:46 +05:30
j
8055e1dd54 update to django 1.9 2016-02-20 17:51:46 +05:30
j
1053673af1 add ox.egg-info to .gitignore 2016-02-20 17:51:46 +05:30
j
be79771f56 add .gitignore 2016-02-19 22:34:23 +05:30
j
43783b00a1 rewrite ox.django.fields 2016-02-19 19:00:53 +05:30
j
2681536b08 use PY2 2016-01-14 17:09:10 +05:30
j
c49f663d54 py3 does not have string.letters 2015-12-25 20:43:15 +05:30
j
1db297169b basestring->six.string_types 2015-12-25 20:38:55 +05:30
j
fa29557a6f ignore audio language if stream length does not match 2015-12-18 15:46:01 +01:00
j
85c1e789ba cleanup cache, fix delete 2015-12-11 20:00:05 +01:00
j
d938091b26 add option delete option to cache 2015-12-11 19:27:54 +01:00
cbcef39ec0 ox.html: fix sanitizing whitespace-only strings
lxml raises:

    ParserError: Document is empty

if you ask it to parse a string with no non-whitespace characters. The
existing truthiness test squashed the commonest case (empty string) but
not the general case.
2015-11-24 18:17:48 +00:00
533a1a627e ox.html.sanitize_fragment: documentation, tests 2015-11-24 18:05:48 +00:00
5448aec902 ox.html.sanitize_html: fix existing tests
The backslashes need to be escaped to come out as literal backslashes in
the Python source code run by doctest.
2015-11-24 17:58:39 +00:00
j
9b887120f7 update url 2015-11-15 15:31:16 +01:00
j
2e38e46cb4 rev is a string 2015-11-15 15:11:20 +01:00
j
c5f5262a96 get version from git 2015-11-15 14:52:59 +01:00
j
98d83192ce jsonc: handle parse errors from 'json' gracefully (fixes #2858)
- JSONDecodeError is only available in simplejson, use ValueError
- imporove error context output
2015-11-12 13:01:29 +01:00
j
3ed213d6d7 update crawler 2015-11-03 23:16:34 +01:00
j
4a8717ee76 update user agent 2015-11-03 23:15:57 +01:00
j
4b3af0cbaf update imdb.movieconnections 2015-10-12 13:56:25 +02:00
7c9887410c Allow definition lists in sanitized HTML 2015-09-14 22:47:21 +02:00
j
5230d59d44 UA strings: Edge+El Capitan 2015-08-04 19:23:47 +02:00
j
77f34143f5 criterion: decode some html 2015-08-02 15:58:59 +02:00
86bffd67b3 API: raise if caller supplies both dict and kwargs
I (incorrectly) wrote something like the following:

    api.find({'query': {...}}, keys=['id'], range=[0, n])

and the query was silently ignored, giving totally different
results to what I wanted. fixes #2822
2015-08-02 15:57:48 +02:00
j
586dbaa932 fix akas 2015-06-01 14:51:09 +02:00
j
4a3fecab19 force cache update 2015-05-23 21:44:37 +02:00
j
5bf53ba463 titles without countries 2015-05-04 10:53:17 +02:00
j
b147c61f5c ubu cleanup 2015-04-26 15:29:32 +02:00
j
5c883e19e6 better ubu parser 2015-04-24 19:02:25 +02:00
j
47bdf3c897 include size for unknown formats 2015-04-24 16:09:31 +02:00
j
72f34f2a60 fix net.oshash for small files 2015-03-19 18:57:50 +05:30
j
ed465c527f better title 2015-03-15 02:38:38 +05:30
j
36c1754725 use video link if its mp4(ubu) 2015-03-15 02:25:31 +05:30
j
9dd0c2416e better description 2015-03-15 02:23:52 +05:30
j
16a955f310 better title 2015-03-15 02:21:56 +05:30
j
c4c0c40825 only ignore title 2015-03-15 02:17:33 +05:30
j
cdea161d2f ignore emtpy parts 2015-03-15 02:12:40 +05:30
j
60ad26d201 update ubu/archive 2015-03-15 01:07:34 +05:30
j
7f7b0c3ee8 filter/map return generators in py3, wrap in list 2015-03-07 23:46:59 +05:30
j
dc6f25aac1 dont fail if files dont have all format keys 2015-01-22 15:31:36 +05:30