Commit graph

56 commits

Author SHA1 Message Date
j
5d02474ce8 decode html 2016-01-08 16:15:10 +05:30
j
71d8825783 normalize names 2016-01-08 16:15:10 +05:30
j
d866b4de91 parse epubs without manifest 2016-01-06 18:40:23 +05:30
j
4ed4926bd8 epub: metadata.conver is id not name 2016-01-05 15:30:15 +05:30
j
ca3888869b epub: use metadata.cover if set 2016-01-05 15:20:47 +05:30
j
78c9c5443f epub parser: take larges image from manifest, strip html tags from description 2016-01-05 14:42:02 +05:30
j
051b634008 ignore errors for non utf-8 html files 2016-01-03 21:00:30 +05:30
j
619a2fbd37 split pdf author 2015-12-25 20:23:22 +05:30
j
f8c09226de normalize language 2015-12-25 19:40:49 +05:30
j
c5afc46af1 cleanup pdf 2015-12-25 13:33:32 +05:30
j
ebc0b95022 better pdf parsing 2015-12-24 20:30:14 +05:30
j
ccd3b166d0 fix empty author 2015-12-24 19:07:36 +05:30
j
fe7769a7ba dont fail if reading metadata.opf fails 2015-12-08 11:54:04 +00:00
j
81cd9c2337 fix epub metadata parser 2015-12-01 17:20:32 +01:00
j
d497e89b2b use logging.getLogger(__name__) 2015-11-29 15:56:38 +01:00
j
c3548a1530 cover can be in svg 2015-11-17 19:23:07 +01:00
j
fba2fa78ce ignore none as epub metadata value 2015-11-16 16:52:36 +01:00
j
a24061518a better epub parsing
- dont fail if epubs are invalid zip
- handle quoted filenames
- dont fail if file is missing
2015-11-16 16:02:45 +01:00
j
62e50c29c6 import description from opf 2015-10-30 11:31:52 +01:00
j
6d19dd5e81 inital cbr support 2015-03-14 13:05:15 +05:30
j
6d3d0bbc43 txt.js/txt.py path has changed 2015-03-08 01:46:55 +05:30
j
7a76e21e99 only strip strings 2015-02-22 16:37:42 +05:30
j
121a2c9ac3 ignore osx resource forks 2014-11-15 01:05:33 +00:00
j
d722ae004b handle utf-16 pdf info 2014-11-15 00:57:49 +00:00
j
89d9ab4f11 fix default icon 2014-10-31 19:49:36 +01:00
j
c6c8e0dc8a try to decrypt pdf with empty password if its encrypted 2014-10-31 16:13:02 +01:00
j
a306370f0d more utf-8 issues 2014-10-31 15:41:46 +01:00
j
3f3299e820 fix epub parsing 2014-10-31 09:58:52 +01:00
j
9db6adc222 run txt cover script with python3 2014-10-01 10:50:46 +02:00
j
c961aa5c64 fix text extraction on osx 2014-09-30 22:30:09 +02:00
j
461fe3b9cf more str/bytes 2014-09-08 21:17:35 +02:00
j
8c6164e0c4 use PyPDF2 2014-09-08 20:46:09 +02:00
j
de68f4c4c4 more py3 porting 2014-09-03 01:09:42 +02:00
j
8e27b9f76e port to python3 2014-09-03 00:38:34 +02:00
j
b87eee40d0 from PIL import Image 2014-09-01 12:38:14 +02:00
j
2cd77e07a2 close_fds=True by default 2014-08-22 18:49:11 +02:00
j
0704e315ba scan in tasks queue 2014-05-28 17:36:26 +02:00
j
8ab68826b5 remove null strings from file metadata 2014-05-28 13:36:44 +02:00
j
1e082c3d1b query fixes, avoid empty authors 2014-05-27 20:10:55 +02:00
j
95085bde8c use metadata.opf or metadata if available 2014-05-27 16:08:14 +02:00
j
7e7478be30 fix pdf info 2014-05-27 11:09:06 +02:00
j
21d6324eb6 performance 2014-05-27 01:45:29 +02:00
j
bc9e6cc92d cache file hash 2014-05-26 11:27:20 +02:00
j
67d1814192 cleanup meta parser 2014-05-26 10:23:10 +02:00
j
b3caaf335a use poppler pdftocairo for preview 2014-05-25 14:44:07 +02:00
j
feddea0ccd lots of stuff 2014-05-21 02:02:21 +02:00
j
326a8f75c6 postupdate, pdf osx fixes 2014-05-20 02:08:38 +02:00
j
b6daa19d73 covers/links 2014-05-19 20:12:02 +02:00
j
1436b14003 fixes 2014-05-19 14:07:08 +02:00
j
9aef3616ba extract textsize, take timestamp for changelog entries update peers on peering events 2014-05-19 11:38:41 +02:00