Commit Graph

41 Commits (86f3f66d5070e6fe18c812c9f2d652395bff1cac)

Author SHA1 Message Date
Timo Bryant 86f3f66d50 can stress test the analyse 2024-01-10 15:45:10 +01:00
Timo Bryant 0e1ae654a7 cache the token 2024-01-10 15:17:30 +01:00
Timo Bryant f645477ded can submit an image 2024-01-10 15:06:39 +01:00
Timo Bryant 92b53529f6 auth groundwork for xs support 2024-01-10 14:30:06 +01:00
Timo Bryant 3036ba243c starting with xs support 2024-01-10 10:47:17 +01:00
Timo Bryant 0a97b37244 add docthor-core 2024-01-05 21:35:02 +01:00
Timo Bryant 5a3f4031d2 dispaly ocr 2024-01-04 12:59:47 +01:00
Timo Bryant d23b4f472c fix assets paths 2024-01-04 12:20:46 +01:00
Timo Bryant 97b5444159 fix gitignore 2024-01-04 12:06:38 +01:00
Timo Bryant 2cab145008 starting with fileprocessor2 2024-01-04 11:56:20 +01:00
Timo Bryant 9ea725fc36 ms ocr file processor works 2024-01-04 11:12:26 +01:00
Timo Bryant accdfbca67 starting with document 2024-01-03 13:17:41 +01:00
Timo Bryant 7ed5a39bac starting with project/corpus 2024-01-03 12:50:29 +01:00
Timo Bryant 4ae5c3bf58 implement fileystem asset manager 2024-01-02 22:50:52 +01:00
Timo Bryant 949f87800a starting with asset manager 2024-01-02 11:00:00 +01:00
Timo Bryant a2483c85d7 starting with canvas 2023-12-30 22:53:11 +01:00
Timo Bryant c758d0b79d first application 2023-12-30 12:03:41 +01:00
Timo Bryant 9f3813a83a starting with ms ocr client 2023-12-29 22:20:33 +01:00
Timo Bryant 30dc3b658d cleanup 2023-12-29 20:45:30 +01:00
Timo Bryant 6fb0ce2a4f move stuff to core-io/tui 2023-12-29 20:42:06 +01:00
Timo Bryant d62aadb95f move ProgressBarFactory to core api 2023-12-27 16:28:51 +01:00
Timo Bryant f777669dfa 7 Add TUI module 2023-12-27 16:16:34 +01:00
Timo Bryant cc727c681a adding core api 2023-12-27 16:11:12 +01:00
Timo Bryant 2deaa204c5 starting with core api project 2023-12-23 00:47:54 +01:00
Timo Bryant 78af3f0d50 add resource and FileSystemResource 2023-12-22 00:57:49 +01:00
Timo Bryant d973262dbd starting with io 2023-12-22 00:39:10 +01:00
Timo Bryant 1ef987f611 rework build logic 2023-12-22 00:18:53 +01:00
Timo Bryant c40ab54012 use progress.step 2023-12-21 23:40:51 +01:00
Timo Bryant 3e5534f184 maybe idf is correct now :D 2023-12-21 23:40:25 +01:00
Timo Bryant 81a30dd2f6 utilize koin 2023-12-21 18:16:12 +01:00
Timo Bryant 606837a76f code cleanup 2023-12-21 17:31:09 +01:00
Timo Bryant 46f1c49ab1 fix build 2023-12-21 17:12:42 +01:00
Timo Bryant 4cafac4583 refactoring into parallelUnordered method 2023-12-18 22:55:29 +01:00
Timo Bryant 13110fa8e5 paralleling finally works 2023-12-18 21:59:15 +01:00
Timo Bryant 71e066fcde trying to make a parallel flow 2023-12-18 19:20:30 +01:00
Timo Bryant 1acc87047f trying to make it parallel 2023-12-17 18:16:31 +01:00
Timo Bryant d995b26459 rewriting IDF stuff 2023-12-17 17:46:51 +01:00
Timo Bryant ca51b50306 Refactor code and add functionality for term frequency calculation
The major changes in this commit involve code refactoring and adding new functionality to calculate Term frequency (TF). The TF is now computed as a separate step from the TF-IDF calculation, which improves the modularity and maintainability of the code. Additionally, an unnecessary test file (MessageUtilsTest.kt) has been removed, and various dependencies have been updated or removed as needed. A few changes were also made to improve the readability and usability of the code.
2023-12-15 21:14:36 +01:00
Timo Bryant 351ab9b9e3 compute term frequence for wikipedia 2023-12-15 20:28:41 +01:00
Timo Bryant 67d65cee93 Add text processing and tfidf libraries
This commit introduces two new libraries: textprocessing and tfidf. The textprocessing library provides classes to read words from a text file, generate histogram from the words, and store the histogram to a CSV file. The tfidf library adds support for term frequency–inverse document frequency (tf-idf) computation using the functionalities provided by the textprocessing library.
2023-12-15 17:17:27 +01:00
Timo Bryant 1259dc8764 Add build.gradle.kts file in tfidf library
Added a new build.gradle.kts file in the tfidf library. This file includes the "docthor.kotlin-library-conventions" plugin. This is the initial setup for the build configuration of the tfidf library.
2023-12-15 15:40:24 +01:00