mirror of
https://github.com/jzillmann/pdf-to-markdown.git
synced 2025-01-16 10:38:41 +01:00
27 lines
969 B
Markdown
27 lines
969 B
Markdown
# Known Issues
|
|
|
|
## Missing or wrong characters
|
|
|
|
The text which comes of pdfjs looks very erronous sometimes. E.g [Life-Of-God-In-Soul-Of-Man](examples/Life-Of-God-In-Soul-Of-Man.pdf).
|
|
The interesting thing is that rendering with pdfjs (online) looks good. So maybe this is just a setup problem !?
|
|
|
|
## Uncovered TOC variants
|
|
|
|
- out of order items [Safe-Communication](examples/Safe-Communication.pdf)
|
|
- items in wrong lines + numbers are not numbers [Life-Of-God-In-Soul-Of-Man](examples/Life-Of-God-In-Soul-Of-Man.pdf)
|
|
- CC-NC_Leitfaden.pdf: un-verified toc entries (and/und/&... etc...)
|
|
- Closed-Syllables.pdf: unverified toc entries
|
|
- Safe-Communication.pdf: One toc element is one page off (8=>9)
|
|
|
|
## Not yet reviewed test PDFS
|
|
|
|
- Achieving-The-Paris-Climate-Agreement.pdf
|
|
- wrong page page mapping ?
|
|
- no page numbers removed
|
|
- no toc
|
|
- Made-with-cc.pdf
|
|
- no toc
|
|
- Watered-Soul-Blog-Book.pdf
|
|
- TOC: character minumum cuts out year
|
|
- TOC: stops to early
|