Commit Graph

7 Commits (e2c13c25039fad68716fe98162599ad3a5bf2b3f)

Author SHA1 Message Date
Timo Bryant 5a3f4031d2 dispaly ocr 2024-01-04 12:59:47 +01:00
Timo Bryant 7ed5a39bac starting with project/corpus 2024-01-03 12:50:29 +01:00
Timo Bryant c758d0b79d first application 2023-12-30 12:03:41 +01:00
Timo Bryant cc727c681a adding core api 2023-12-27 16:11:12 +01:00
Timo Bryant 1ef987f611 rework build logic 2023-12-22 00:18:53 +01:00
Timo Bryant 4cafac4583 refactoring into parallelUnordered method 2023-12-18 22:55:29 +01:00
Timo Bryant 67d65cee93 Add text processing and tfidf libraries
This commit introduces two new libraries: textprocessing and tfidf. The textprocessing library provides classes to read words from a text file, generate histogram from the words, and store the histogram to a CSV file. The tfidf library adds support for term frequency–inverse document frequency (tf-idf) computation using the functionalities provided by the textprocessing library.
2023-12-15 17:17:27 +01:00