mirror of
https://github.com/jzillmann/pdf-to-markdown.git
synced 2024-12-28 01:08:57 +01:00
846 B
846 B
Known Issues
Missing or wrong characters
The text which comes of pdfjs looks very erronous sometimes. E.g Life-Of-God-In-Soul-Of-Man. The interesting thing is that rendering with pdfjs (online) looks good (but copying the text shows the same distortion). So maybe this is just a setup problem !?
Uncovered TOC variants
- out of order items Safe-Communication
- items in wrong lines + numbers are not numbers Life-Of-God-In-Soul-Of-Man
- no page numbers The-Art-of-Public-Speaking.
- multiline headlines: WoodUp
- Detecting list of figures (and creating headlines) Achieving-The-Paris-Climate-Agreement