Commit Graph

5 Commits

Author SHA1 Message Date
Johannes Zillmann
7abafc61e7 Improve word boundary detection
- sometimes a word is provided with multiple items. E.g: "T his is a sen tence"
- use x-axis distance to not put whitespaces in the middle of a word
- also tweak the line detection a bit (for Alice)
2024-05-20 00:22:24 -06:00
Johannes Zillmann
616909481a Don't print globals twice 2021-07-18 14:13:38 -06:00
Johannes Zillmann
46234417ad Fine tune line detection
* Before lines where assembled that really separate lines
2021-07-18 13:07:06 -06:00
Johannes Zillmann
94a7405671 Lookup and verify toc links 2021-04-25 14:41:50 +02:00
Johannes Zillmann
baa5b4fadc Add 6 more test PDFs 2021-04-18 11:34:11 +02:00