|
8276423eb8
|
Prioritise incomplete object during deduplication.
Signed-off-by: yzrh <yzrh@noema.org>
|
2023-01-04 13:51:13 +00:00 |
|
|
7ac0971a17
|
Handle invalid result from PDF parser.
Signed-off-by: yzrh <yzrh@noema.org>
|
2023-01-03 15:39:53 +00:00 |
|
|
e0fe937e1a
|
Fix KDH decryption.
Signed-off-by: yzrh <yzrh@noema.org>
|
2023-01-03 12:12:42 +00:00 |
|
|
4a02b8bfc7
|
Fix inconsistent whitespace detection in PDF parser.
Signed-off-by: yzrh <yzrh@noema.org>
|
2023-01-03 00:13:56 +00:00 |
|
|
7d9d658461
|
Handle duplicated image in HN.
Signed-off-by: yzrh <yzrh@noema.org>
|
2023-01-02 15:38:45 +00:00 |
|
|
000405693e
|
Update CHANGE.
Signed-off-by: yzrh <yzrh@noema.org>
|
2023-01-01 21:26:44 +00:00 |
|
|
d6fa934b5f
|
Handle incomplete PDF object in parser.
Signed-off-by: yzrh <yzrh@noema.org>
|
2023-01-01 20:51:09 +00:00 |
|
|
1a1fee1034
|
Handle duplicated object in CAJ.
Signed-off-by: yzrh <yzrh@noema.org>
|
2023-01-01 19:31:33 +00:00 |
|
|
cde014cffb
|
Improve PDF parser.
Signed-off-by: yzrh <yzrh@noema.org>
|
2023-01-01 18:58:43 +00:00 |
|
|
9019a18449
|
Split md5 function.
Signed-off-by: yzrh <yzrh@noema.org>
|
2023-01-01 11:11:56 +00:00 |
|
|
a18de8f2ef
|
Rename JBIG decoder.
Signed-off-by: yzrh <yzrh@noema.org>
|
2023-01-01 10:09:08 +00:00 |
|
|
70e1e7ea97
|
Fix JBIG decoder data type.
Signed-off-by: yzrh <yzrh@noema.org>
|
2023-01-01 00:42:20 +00:00 |
|
|
bffb8ce8a4
|
Fix JBIG decoder.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-31 21:17:28 +00:00 |
|
|
3ac51d66b9
|
Fix JBIG table length.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-31 18:52:06 +00:00 |
|
|
0bbf8e65dd
|
Update CHANGE.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-31 11:28:03 +00:00 |
|
|
220a81c2ad
|
Fix HN image compositing.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-31 10:48:29 +00:00 |
|
|
1d899d934d
|
Fix PDF object check.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-30 20:16:53 +00:00 |
|
|
226f16ddf4
|
Handle HN page with figure only.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-30 15:04:32 +00:00 |
|
|
9646ee61c3
|
Update CHANGE.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-30 02:04:43 +00:00 |
|
|
5466a441df
|
Fix type casting when processing data.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-30 02:00:12 +00:00 |
|
|
1ce3f89574
|
Handle combination of text and image in page content.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-29 21:10:03 +00:00 |
|
|
5a1afb0056
|
Link against libc for iconv, find openjpeg header with pkgconf.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-29 17:30:36 +00:00 |
|
|
060bc00a0d
|
Update CHANGE.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-29 06:30:59 +00:00 |
|
|
97931e1470
|
Fix PDF object check.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-29 05:23:04 +00:00 |
|
|
cd0af5ba3c
|
Fix buffer overflow when object size is less than 8 bytes.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-29 04:05:34 +00:00 |
|
|
988a751c15
|
Handle missing root object which is parent of others.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-29 03:00:11 +00:00 |
|
|
8083b30530
|
Add JPEG 2000 support.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-29 00:40:14 +00:00 |
|
|
abce2fd2e4
|
Add preliminary support for HN figure placement.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-28 19:29:46 +00:00 |
|
|
224a09a015
|
Update CHANGE.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-26 00:13:18 +00:00 |
|
|
c2ad6549fb
|
Handle headless HN and page with no image.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-25 23:18:17 +00:00 |
|
|
d2826fa075
|
Simplify JBIG decoder.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-25 05:15:56 +00:00 |
|
|
288b65a1fd
|
Handle different JPEG colour component.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-25 01:26:05 +00:00 |
|
|
9c1f1d0b75
|
Fix HN conversion and add JBIG2 support.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-24 23:29:56 +00:00 |
|
|
ac3b1dda63
|
Fix memory leak and data type.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-22 19:48:48 +00:00 |
|
|
63728e1340
|
Add error message for JBIG.
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-20 00:23:46 +00:00 |
|
|
3550095959
|
Update Makefile
Signed-off-by: yzrh <yzrh@noema.org>
|
2022-12-19 23:54:06 +00:00 |
|
|
86b6487fff
|
Remove -march=native in Makefile and remove char **envp for POSIX compliance.
|
2021-09-02 17:04:05 +00:00 |
|
|
409acceffa
|
Fix memory leak.
|
2021-01-29 21:50:20 +00:00 |
|
|
7270c1771f
|
Update CHANGE.
|
2021-01-13 17:30:11 +00:00 |
|
|
7a5dd05425
|
Add wrapper for PDF operation.
|
2021-01-13 17:07:45 +00:00 |
|
|
057a7acc51
|
Fix transformation matrix.
|
2021-01-12 03:13:07 +00:00 |
|
|
f685e91d35
|
Fix HN dictionary generation.
|
2021-01-12 02:26:39 +00:00 |
|
|
2aab394684
|
Fix root object dictionary generation.
|
2021-01-11 22:57:59 +00:00 |
|
|
1994f122cc
|
Decode JBIG and JPEG during HN conversion.
|
2021-01-03 03:01:28 +00:00 |
|
|
b20c6ad3ed
|
Handle binary data in dictionary.
|
2020-12-31 20:38:02 +00:00 |
|
|
3bd7ea7520
|
Improve portability.
|
2020-12-31 18:45:02 +00:00 |
|
|
1f62c53da6
|
Produce PDF directly from KDH.
|
2020-12-30 21:12:52 +00:00 |
|
|
98691d4203
|
Add HN text extraction.
|
2020-12-30 03:09:00 +00:00 |
|
|
8d6fbb43c9
|
Update README.
|
2020-12-29 02:33:03 +00:00 |
|
|
5c5ddc926b
|
Update HN data structure.
|
2020-12-29 02:10:17 +00:00 |
|