Timo Bryant
92b53529f6
auth groundwork for xs support
2024-01-10 14:30:06 +01:00
Timo Bryant
3036ba243c
starting with xs support
2024-01-10 10:47:17 +01:00
Timo Bryant
17931ea1ca
add targets
2024-01-08 08:59:55 +01:00
Timo Bryant
cb8867af2e
korge: add rotation
2024-01-05 21:56:05 +01:00
Timo Bryant
8d4ba39793
korge: can click on stuff
2024-01-05 21:52:09 +01:00
Timo Bryant
0a97b37244
add docthor-core
2024-01-05 21:35:02 +01:00
Timo Bryant
24fca8c62c
korge: add basic interaction
2024-01-05 18:01:48 +01:00
Timo Bryant
8dbe0c59fa
add korge project
2024-01-04 14:30:47 +01:00
Timo Bryant
5a3f4031d2
dispaly ocr
2024-01-04 12:59:47 +01:00
Timo Bryant
d23b4f472c
fix assets paths
2024-01-04 12:20:46 +01:00
Timo Bryant
97b5444159
fix gitignore
2024-01-04 12:06:38 +01:00
Timo Bryant
2cab145008
starting with fileprocessor2
2024-01-04 11:56:20 +01:00
Timo Bryant
9ea725fc36
ms ocr file processor works
2024-01-04 11:12:26 +01:00
Timo Bryant
accdfbca67
starting with document
2024-01-03 13:17:41 +01:00
Timo Bryant
7ed5a39bac
starting with project/corpus
2024-01-03 12:50:29 +01:00
Timo Bryant
4ae5c3bf58
implement fileystem asset manager
2024-01-02 22:50:52 +01:00
Timo Bryant
949f87800a
starting with asset manager
2024-01-02 11:00:00 +01:00
Timo Bryant
ac412385bb
add additonal snippets
2024-01-01 14:56:37 +01:00
Timo Bryant
8ef054baa4
can draw a box in document
2024-01-01 13:50:50 +01:00
Timo Bryant
a2483c85d7
starting with canvas
2023-12-30 22:53:11 +01:00
Timo Bryant
b4ab91e1db
can display and zoom an image
2023-12-30 12:43:58 +01:00
Timo Bryant
c758d0b79d
first application
2023-12-30 12:03:41 +01:00
Timo Bryant
9f3813a83a
starting with ms ocr client
2023-12-29 22:20:33 +01:00
Timo Bryant
30dc3b658d
cleanup
2023-12-29 20:45:30 +01:00
Timo Bryant
6fb0ce2a4f
move stuff to core-io/tui
2023-12-29 20:42:06 +01:00
Timo Bryant
d62aadb95f
move ProgressBarFactory to core api
2023-12-27 16:28:51 +01:00
Timo Bryant
f777669dfa
7 Add TUI module
2023-12-27 16:16:34 +01:00
Timo Bryant
cc727c681a
adding core api
2023-12-27 16:11:12 +01:00
Timo Bryant
2deaa204c5
starting with core api project
2023-12-23 00:47:54 +01:00
Timo Bryant
106bd8a5cc
add docu and target
2023-12-22 10:31:16 +01:00
Timo Bryant
78af3f0d50
add resource and FileSystemResource
2023-12-22 00:57:49 +01:00
Timo Bryant
d973262dbd
starting with io
2023-12-22 00:39:10 +01:00
Timo Bryant
1ef987f611
rework build logic
2023-12-22 00:18:53 +01:00
Timo Bryant
c40ab54012
use progress.step
2023-12-21 23:40:51 +01:00
Timo Bryant
3e5534f184
maybe idf is correct now :D
2023-12-21 23:40:25 +01:00
Timo Bryant
81a30dd2f6
utilize koin
2023-12-21 18:16:12 +01:00
Timo Bryant
606837a76f
code cleanup
2023-12-21 17:31:09 +01:00
Timo Bryant
46f1c49ab1
fix build
2023-12-21 17:12:42 +01:00
Timo Bryant
4cafac4583
refactoring into parallelUnordered method
2023-12-18 22:55:29 +01:00
Timo Bryant
13110fa8e5
paralleling finally works
2023-12-18 21:59:15 +01:00
Timo Bryant
71e066fcde
trying to make a parallel flow
2023-12-18 19:20:30 +01:00
Timo Bryant
1acc87047f
trying to make it parallel
2023-12-17 18:16:31 +01:00
Timo Bryant
d995b26459
rewriting IDF stuff
2023-12-17 17:46:51 +01:00
Timo Bryant
ca51b50306
Refactor code and add functionality for term frequency calculation
...
The major changes in this commit involve code refactoring and adding new functionality to calculate Term frequency (TF). The TF is now computed as a separate step from the TF-IDF calculation, which improves the modularity and maintainability of the code. Additionally, an unnecessary test file (MessageUtilsTest.kt) has been removed, and various dependencies have been updated or removed as needed. A few changes were also made to improve the readability and usability of the code.
2023-12-15 21:14:36 +01:00
Timo Bryant
351ab9b9e3
compute term frequence for wikipedia
2023-12-15 20:28:41 +01:00
Timo Bryant
67d65cee93
Add text processing and tfidf libraries
...
This commit introduces two new libraries: textprocessing and tfidf. The textprocessing library provides classes to read words from a text file, generate histogram from the words, and store the histogram to a CSV file. The tfidf library adds support for term frequency–inverse document frequency (tf-idf) computation using the functionalities provided by the textprocessing library.
2023-12-15 17:17:27 +01:00
Timo Bryant
1259dc8764
Add build.gradle.kts file in tfidf library
...
Added a new build.gradle.kts file in the tfidf library. This file includes the "docthor.kotlin-library-conventions" plugin. This is the initial setup for the build configuration of the tfidf library.
2023-12-15 15:40:24 +01:00
TBr
35f1056929
Initial commit
2023-12-15 14:25:01 +00:00