Commit Graph

20 Commits

Author SHA1 Message Date
Johannes Zillmann
125d4f3079 Simplify PDF pipeline a bit 2024-03-25 16:36:58 -06:00
Johannes Zillmann
5bf4988da2 Move from TsLint to EsLint
- Fix some error's but still not green (good enough for now)
2024-03-20 09:31:09 -06:00
Johannes Zillmann
c696806a0e Update typescript 2024-03-19 18:08:18 -06:00
Johannes Zillmann
02c2fd04fe DetectToc removes TOC items and marks headlines 2021-07-19 10:15:59 -06:00
Johannes Zillmann
d223e8a790 Move types to front 2021-07-18 14:25:55 -06:00
Johannes Zillmann
616909481a Don't print globals twice 2021-07-18 14:13:38 -06:00
Johannes Zillmann
94a7405671 Lookup and verify toc links 2021-04-25 14:41:50 +02:00
Johannes Zillmann
19a76d6163 Publish TOC as global (rudimentary) 2021-04-25 08:15:10 +02:00
Johannes Zillmann
5b611cd506 Rename TocDetection to DetectToc 2021-04-18 15:31:45 +02:00
Johannes Zillmann
a1ea24cc3a Improved TOC detection
- Restrict pages before numbered line
2021-04-18 10:05:34 +02:00
Johannes Zillmann
6283ab7a96 Track evaluation score (optionally)
Makes it easier to see how a value got classified
2021-04-01 18:16:42 +02:00
Johannes Zillmann
89d4bbd2f9 Cover globals in tests 2021-03-28 10:58:24 +02:00
Johannes Zillmann
d7d3502a25 Fix processing pdfs with no page numbers 2021-03-28 10:21:26 +02:00
Johannes Zillmann
4c77274d16 Fix tests 2021-03-23 08:46:14 +01:00
Johannes Zillmann
c98145a63c Test for remote PDFS 2021-03-22 09:03:26 +01:00
Johannes Zillmann
f5a180113d No unused locals 2021-03-21 08:39:42 +01:00
Johannes Zillmann
9bd5043f2e Very basic removal of repetitive elements 2021-03-14 12:15:37 +01:00
Johannes Zillmann
d5523fb1d4 Split result files
* Due 100 MB limit of Github
2021-03-13 22:46:10 +01:00
Johannes Zillmann
713a82b41d Stabilize font display in tests
* If multiple PDF are tested after another their font ids change (e.g. `g_d0_f1` becomes `g_d1_f1`)
2021-03-13 19:38:47 +01:00
Johannes Zillmann
417cc2ab94 Add Test infrastructure for example PDFs 2021-03-13 08:46:22 +01:00