Commit graph

677 commits

Author SHA1 Message Date
j
1c871f4d31 add method to add Access-Control-Allow-Origin to HttpFileResponses 2014-02-05 06:37:37 +00:00
j
34691832eb revert change, fragment_fromstring only parses single element 2014-02-04 10:44:51 +00:00
j
8bda86c17d use fragment_fromstring instead of document_fromstring 2014-02-04 10:40:01 +00:00
j
7577b319ce dont take random number if film has no year 2014-01-17 23:09:45 +05:30
j
d1a5613f3f more summary fixes 2014-01-16 13:56:07 +05:30
j
5a61dea925 fix imdb plotsummary parser 2014-01-16 13:49:30 +05:30
j
5179a4fcf9 add yt 4k format 2014-01-15 22:03:39 +05:30
j
2456ec2d5a ox.web.youtube: use in/out/value like ox.srt, decode html value 2014-01-15 20:12:14 +05:30
j
575549ae33 only add api methods to API instance 2014-01-03 00:54:49 +05:30
j
2abe99c89f fix wikipedia movie parser 2013-12-22 13:38:43 +05:30
j
5c1ab13749 no need to load json string into ram 2013-11-15 16:16:21 +01:00
j
37cd92dfba fix html cleanup of empty string 2013-12-01 12:35:38 +00:00
rolux
6f68729b6f API: don't fail on missing 'doc' property 2013-11-23 15:17:59 +01:00
j
d664d99f89 rewrite sanitize_html to support global attributes 2013-11-10 22:00:24 +00:00
j
d8bb547e25 workaround for python2.6 2013-11-06 10:33:45 +01:00
j
828223ad82 dont break ox.API subclasses 2013-11-03 16:39:57 +01:00
j
714729fee7 return new class for each ox.API call 2013-11-02 17:40:01 +01:00
j
d38da54a17 strip <p> 2013-10-31 13:49:55 +01:00
j
5dcd8b3552 allow iframes in sanitize_html 2013-10-24 16:40:04 +00:00
j
38853b1f4b detect IE11, part of ticket #1917 2013-10-24 00:24:13 +02:00
j
e3ee66fe08 trivia 2013-10-21 17:33:00 +02:00
j
0effb090a3 move EXTENSIONS to ox.file and add image type 2013-10-14 20:07:05 +02:00
j
5c6ff50027 use iter to read file with a multiple of hash block_size(sha1sum) 2013-10-14 12:35:07 +02:00
j
7d712445bf utf-8 filenames 2013-10-11 20:38:35 +02:00
j
413848638b remove debug 2013-10-11 20:13:10 +02:00
j
68b0e525ca fixes for django 1.5.x 2013-10-11 20:12:37 +02:00
j
36c7e95788 support nulls_last in sqlite 2013-10-11 20:12:23 +02:00
j
74a9b812b0 update user agent, fixes #1894 2013-09-27 18:14:34 +02:00
j
98ab0e29db support returning more than 10 results 2013-09-08 15:56:57 +02:00
rolux
cb45a25a7c geo.get_country: allow name as arg, not just code 2013-08-28 12:06:56 +02:00
j
22eecc22e4 allow more html5 tags 2013-08-27 08:51:18 +00:00
j
a8e76893d3 only use most common title per type, fixes #1826 2013-08-24 17:30:37 +02:00
rolux
f429ed8b07 add geo.split_geoname 2013-08-18 11:56:48 +02:00
j
3cc5659310 add option to get tweets from one user 2013-08-01 15:14:06 +02:00
rolux
3bf45b9d33 update UA parser 2013-07-30 19:06:01 +02:00
rolux
68a324d8fa update UA parser 2013-07-30 18:33:33 +02:00
j
611db3ed7b fix typos 2013-07-30 15:22:23 +02:00
rolux
893a70791c update ua parser 2013-07-29 19:03:46 +02:00
rolux
996344c689 update ua parser 2013-07-29 18:22:22 +02:00
rolux
b7f98ffecd cosmetic changes 2013-07-25 09:29:24 +02:00
j
ba6ee2e62e make sound unique 2013-07-23 14:54:32 +02:00
j
f3d26879fd one more 2013-07-16 13:42:58 +02:00
j
aa8641f22f more titles to ignore, cloes #1532 2013-07-16 13:41:49 +02:00
j
7acbc72305 return utf-8 encoded json 2013-07-16 11:10:47 +00:00
j
02afccc253 normalize alternative title country names 2013-07-16 11:41:16 +02:00
j
07e1a36ba9 filter working titles, one more World-wide/Internaltional 2013-07-16 11:35:55 +02:00
j
5b9cb279ba world-wide title 2013-07-16 11:02:43 +02:00
j
4c41db9460 add script to update ox.geo.COUNTRIES, normalize_country_name takes and returns a unicode string 2013-07-13 16:14:25 +02:00
j
adfe642547 use geo.normalize_country_name for normalize imdb names 2013-07-13 15:48:26 +02:00
j
ad7e21e7a8 fix lxml unicode handling 2013-07-04 20:32:54 +02:00
j
b1d248c4df add timeout as option to twitter.find, also return html 2013-07-04 12:22:56 +02:00
rolux
0d9bba8865 in wrapText, when testing smaller line widths, don't hyphenate words that were previously not hyphenated 2013-07-03 13:08:41 +02:00
j
330cc5ff3b fix missing space i.e. for:
ox.image.wrapText("VERTEIDIGUNG DER ZEIT",  608, 2, "./data/MontserratBold.ttf", 48)
2013-07-02 20:48:37 +02:00
j
6996f9c422 fix typos 2013-07-02 13:00:27 +00:00
j
1d429b6d33 fix date time serialization 2013-07-02 14:44:18 +02:00
j
78986c671e alternative titles are flipped now 2013-06-29 18:50:10 +02:00
j
deaa2bb988 dont return x as release date 2013-06-29 18:21:58 +02:00
j
1e65f3e478 fix dates < 1900 2013-06-28 14:57:31 +00:00
j
f7e9605828 fix release date parser 2013-06-28 16:53:25 +02:00
j
d7bd98d63a add www. 2013-06-17 22:43:44 +02:00
j
27df553ffb move 2013-06-14 12:19:36 +02:00
j
15ba89bac2 use ddg 2013-06-14 12:17:18 +02:00
j
e2bffccd36 add youtube playlist parser 2013-06-12 18:28:19 +02:00
rolux
f3d91b78d6 update criterion.py 2013-06-09 16:48:58 +02:00
rolux
339b7026f5 ox.file: add ensure_ascii parameter to write_json; add write_image method (write_path + image.save) 2013-06-09 16:45:26 +02:00
j
3951c67623 only look at function closures 2013-06-07 09:44:00 +00:00
j
223ac3c534 pass settings to api template 2013-06-07 11:25:28 +02:00
j
008654ad5d fix api documentation for double decorators 2013-06-07 11:13:01 +02:00
rolux
d79a3c0b95 fix #1574 (wrong id for non-imdb series episode 0 item) 2013-06-06 20:37:46 +02:00
rolux
cca251bc32 ox.image: add getTextSize method 2013-06-06 17:29:52 +02:00
j
f086c64e51 parse arsenal 2013-06-06 11:20:43 +02:00
j
f535b82e7b faster and more reliable encoding detection of html content 2013-06-01 13:29:24 +02:00
j
3165e3a8b1 fix unicode detection 2013-06-01 13:21:13 +02:00
j
7d7c7c9407 fix ddg 2013-06-01 12:25:20 +02:00
j
96f7975747 episodes without season/episode are season 1, add season besides having it in the title, fixes #1548 2013-05-31 22:34:06 +02:00
j
8038b0d13f fix google parser, and by that imdb id lookup, fixes #1545 2013-05-31 22:05:25 +02:00
j
cb3701d3e2 more unwanted akas, fixes #1532 2013-05-31 21:45:25 +02:00
rolux
2daae6a4c3 less error-prone version 2013-05-31 16:01:55 +02:00
rolux
14137c30c8 can't access set via index 2013-05-31 15:57:42 +02:00
rolux
762fab3519 typo 2013-05-31 15:56:49 +02:00
rolux
c33edd3ff3 fix '.en' stripping (movies can have parts, do don't check for '1 srt is en', but 'all srts are en') 2013-05-31 15:52:09 +02:00
j
8bd76ed27f normalize_paths not needed as extra function 2013-05-31 15:23:38 +02:00
rolux
913c8f4c1b in parse_item_files, strip unneeded '.en' (if, per version and per subtitle extension, there is only one language=='en' file) 2013-05-31 15:18:33 +02:00
j
e1508f4068 add movie.normalize_paths and call in parse_item_files 2013-05-31 13:06:42 +00:00
j
986a788bc7 fix #1546 2013-05-31 11:54:57 +02:00
rolux
647f027e8a criterion.py: fix title and synopsis detection 2013-05-31 11:03:09 +02:00
j
4f0654db68 fix keywords, fixes #1541 2013-05-30 21:12:28 +02:00
rolux
540f0bc4bd fix imdb keywords parser 2013-05-30 21:10:06 +02:00
j
100a93296f dont keep originalTitle for episodes, fixes #1535 2013-05-30 13:55:54 +02:00
j
cb9a791a97 ignore more alternative titles, fixes #1532 2013-05-30 11:59:40 +02:00
j
a93dc6e37b fix filmingLocations 2013-05-15 00:23:00 +02:00
j
8563ea8239 add column, line to javascript tokenizer tokens 2013-05-10 13:00:32 +00:00
j
be9424036f add 3gp to video types 2013-05-09 21:17:27 +02:00
j
642e50b721 remove debug 2013-05-07 18:41:44 +02:00
j
0ad7a088bf fix wikipedia parser 2013-05-06 10:25:56 +02:00
j
7e4c2bdaff youtube 2013-03-23 22:28:47 +05:30
j
89f868fc39 composer lists of lists 2013-03-14 14:17:10 +05:30
j
f7186b936c fix wikipedia parser 2013-03-02 05:15:57 +05:30
j
24410c458a mini-series too 2013-03-01 16:01:35 +05:30
j
02b20b0042 expose isSeries flag 2013-03-01 15:15:18 +05:30
j
79ecde337c parse composer 2013-02-26 13:20:06 +05:30
j
8b3230df05 parse color and sound better 2013-02-25 19:23:44 +05:30
j
2d65dcd16a parse color and sound 2013-02-25 19:18:14 +05:30
j
e896dac0de camel case 2013-02-25 13:57:48 +05:30
j
8609e6e9f4 make language unique, fixes #1280 2013-02-20 09:46:34 +05:30
j
1b13c7cb00 parse production companies 2013-02-18 19:20:30 +05:30
j
6e3390d65a fix default values from south migration 2013-02-16 00:35:34 +00:00
j
5e989bcb19 fix movie.parse_path, set language 2013-02-17 20:09:48 +05:30
rolux
340c7fb924 add normalize_country_name method 2013-02-09 10:07:38 +05:30
rolux
f7d72335ef update format_path / parse_path 2013-02-08 23:28:21 +05:30
j
284cb97f1c fix #1216 ignore thousand separators 2013-02-01 16:13:40 +05:30
j
4ae8783d27 ignore cache if not able to load json file 2013-01-31 20:29:21 +05:30
j
e2936705c4 fix import 2013-01-31 20:07:11 +05:30
j
bb78574bc0 add ox.web.twitter 2013-01-31 19:48:07 +05:30
j
a7307ec08a alos load releaseinfo for international titles, fixes #1128 2013-01-31 14:55:23 +05:30
j
f0c2b888d7 fix poster lookup 2013-01-29 08:05:31 +05:30
j
2bfd63ff0f add ox.fix_bad_unicode 2012-12-30 16:16:23 +01:00
j
8021672cfd add ox.web.ubu 2012-12-29 13:04:24 +01:00
rolux
5b5cb0f2c4 typo 2012-12-27 15:05:58 +01:00
rolux
e4e6a2ff3a image module: update getImageHash method 2012-12-27 14:47:15 +01:00
rolux
0ddb32858b image module: add getImageHash method 2012-12-27 14:45:20 +01:00
rolux
5c377a5d3a saner copyfile method (that works for files > max python str len) 2012-12-26 21:26:43 +01:00
j
7884e8671a dont use Japan (imdb display title) (English title) 2012-11-19 12:04:06 +01:00
j
9dfcb77be8 international english before often original uk 2012-11-11 17:23:34 +01:00
j
6529e5f1c1 some more special cases 2012-11-11 17:15:40 +01:00
j
43a54740bd restored version 2012-11-11 03:08:54 +01:00
j
8229a651f7 USA/UK 2012-11-11 02:31:51 +01:00
rolux
b8b7e666bc ua parser: detect chrome on ios 2012-11-09 23:07:31 +01:00
j
11a00db3bb update series detection, fixes #1153 2012-11-09 14:28:34 +01:00
j
c7a2ef21c7 titles 2012-11-08 20:34:26 +01:00
j
91ebecccc3 alternative spelling 2012-11-07 20:07:24 +01:00
j
6eb18c3077 use title from series entry as series title 2012-11-07 00:05:18 +01:00
j
15e78b8cb1 , 2012-11-04 16:12:28 +01:00
j
8c6f211f25 add more video extensions 2012-11-04 16:08:00 +01:00
j
dd316160be fix youtube download urls 2012-11-01 13:14:59 +01:00
rolux
0001859ba8 improve parse_useragent 2012-10-27 21:59:40 +02:00
rolux
d73833f467 in parse_useragent, add chromium 2012-10-27 21:35:46 +02:00
rolux
48240cee7f update parse_useragent 2012-10-27 18:51:39 +02:00
j
71b99c8ce9 better unicode support for download names 2012-10-24 17:18:23 +02:00
j
3b7b87a89f match charset in lower case 2012-10-19 14:22:39 +02:00
j
639b74eac3 reduce choice of alternative titles for english films, fixes #1084 2012-10-10 13:27:19 +02:00
j
13a55763d3 take first title only, fixes #1088 2012-10-10 13:08:37 +02:00
j
eaa8b9995f use title/originalTitle not title/internationalTitle 2012-10-09 14:55:29 +02:00
j
073aaa1b32 make directors unique but keep order, fixes #1076 2012-10-08 13:44:02 +02:00
rolux
104e9f1c5c format_path: don't use more than 10 director names 2012-10-08 11:27:28 +02:00
j
7fe62b5ce3 regex first, fixes #1058 2012-10-01 23:29:57 +02:00
j
102365eb8e ignore - descriptions, fixes #1047 2012-09-30 12:17:45 +02:00
j
1935b76b46 also strip single quotes from titles. fixes #1050 2012-09-30 12:14:33 +02:00
j
e0dd4d53b1 fix only one connection, parse connection description too 2012-09-29 18:13:58 +02:00
j
da09714910 parse new movieconnections page, fixes #1045 2012-09-29 17:55:37 +02:00
rolux
52932bccec fix opera version detection 2012-09-29 14:48:19 +02:00
j
c690f811b6 minor change 2012-09-25 13:54:54 +02:00
j
7c60ddd584 allow DictField to be None 2012-09-25 13:26:50 +02:00
j
8f82cefa78 cleanup imdb characters 2012-09-25 12:57:57 +02:00
j
c8cea371e3 load series pages with same timeout 2012-09-23 15:26:20 +02:00
rolux
280d5ddb39 (minor change) 2012-09-23 15:21:38 +02:00
rolux
8b0b960cce normalize: add 'dos' to lastname prefixes 2012-09-23 15:20:36 +02:00
rolux
62358ecd55 normalize: add 'des' to lastname prefixes 2012-09-23 15:19:54 +02:00
j
7ae6556ff4 cleanup episode titles 2012-09-23 15:12:07 +02:00
j
46f9b36851 update usera agent 2012-09-23 12:01:12 +02:00
j
1272e70891 remove empty alternative titles 2012-09-22 23:28:59 +02:00
j
a79b2a6134 fix stop words 2012-09-22 22:56:10 +02:00
j
6df54b5ca2 add title stop words again 2012-09-22 22:52:18 +02:00
j
b5b480e3f2 use imdb display title before other titles, dont add internationalTitle if its the same as title 2012-09-22 22:44:08 +02:00
j
d29f9e3dc8 fix movies with only one alternative title 2012-09-22 22:38:02 +02:00
j
ca9ffcf867 add Australia (foo) to International titles 2012-09-22 22:34:52 +02:00
j
e6aae3abe5 case sensitive title type match 2012-09-22 22:11:54 +02:00
j
da72fbdaed imdb: retun title/internationalTitle, #947; cleanup alternative titles, #963 2012-09-22 21:40:01 +02:00
j
c69a2ee825 detect html5 encoding 2012-09-22 20:57:40 +02:00
j
2d1ad2785d ignore anything after timecodes in srt, fixes #996 2012-09-21 11:48:02 +02:00
j
1522f870b7 strip whitespace 2012-09-14 11:31:29 +02:00
j
7a8609f5cf cleanup title in one place 2012-09-14 11:27:36 +02:00
j
36c6e6908e cleanup imdb titles, fixes #957 2012-09-14 11:17:34 +02:00
j
9f4020e238 update imdb parser, fixes #942 2012-09-12 23:33:29 +02:00
rolux
327dd21d5c add 'ben' and 'le' to surname suffixes (which are actually prefixes, btw) 2012-09-12 21:45:34 +02:00
rolux
654bd0da0e movie.py: update format_underscores 2012-09-11 19:58:30 +02:00
rolux
c467f7fa52 typo 2012-09-11 19:22:45 +02:00
rolux
2879c66d46 movie.py: deal with special chars that cause problems in afp/smb (?<>) 2012-09-11 19:21:45 +02:00
rolux
16d16b0c45 movie.py: use data['isEpisode'] 2012-09-11 18:30:42 +02:00
rolux
99f8db0385 movie.py: fix bugs with episodes, discard prefix directory 2012-09-11 17:55:35 +02:00
j
bde28ce097 remove seriesDirector, just use director 2012-09-11 15:29:10 +02:00
j
1e53378a78 cleanup ox.movie.format_path 2012-09-11 14:36:51 +02:00
j
8ba74a1e4b fix some failing tests 2012-09-09 19:28:11 +02:00
j
fcae567823 fix import 2012-09-09 18:48:40 +02:00
j
764db8556b up version 2012-09-07 03:57:05 +02:00
j
9bc239de51 import fallback 2012-09-06 13:45:30 +02:00
j
3a0c35372d convert some left over readUrls to read_url 2012-09-06 13:25:57 +02:00
rolux
7ebfdaff30 revert changes 2012-09-01 05:27:59 +02:00
rolux
e04cc5b819 merging changes 2012-09-01 05:16:28 +02:00
rolux
fb26797b95 geo.py: add crosses_dateline and get_area 2012-09-01 05:01:08 +02:00
j
f5ad5d73e4 some writers are still lists of lists 2012-08-30 11:46:25 +02:00
j
2d04e03d3d ox.web.imdb.get_episodes: get all episodes of a series 2012-08-26 15:27:16 +02:00
j
ec04c29bcd accept and support compressed response from api, POST with gzip encoded body is not supported in django/wsgi 2012-08-22 00:54:04 +02:00
j
ebf3b691b2 support new series format in parse_movie_path 2012-08-22 00:53:08 +02:00
rolux
b728fdc9b7 movie.py: more bugfixes 2012-08-22 17:20:58 +02:00
rolux
1f5c060bbc movie.py: more bugfixes 2012-08-22 17:19:31 +02:00
rolux
c12914adb5 movie.py: more bugfixes 2012-08-22 17:17:22 +02:00
rolux
3ded47ac5b movie.py: more bugfixes 2012-08-22 17:10:54 +02:00
rolux
7a51ec5fe9 movie.py: remove debug information, more bugfixes 2012-08-22 14:52:35 +02:00
rolux
b9df4de27e movie.py: temporarily add some debug information 2012-08-22 14:39:51 +02:00