Commit Graph

23 Commits

Author SHA1 Message Date
Josako
2ca006d82c Added excluded element classes to HTML parsing to allow for more complex document parsing
Added chunking to conversion of HTML to markdown in case of large files
2024-08-22 16:41:13 +02:00
Josako
ab38dd7540 - Improvements working with the cloud, minio, graylog and some first bugfixing 2024-08-13 09:04:19 +02:00
Josako
9caa423bcf - Getting containers ready for the cloud 2024-08-07 08:37:13 +02:00
Josako
ab8359365d - Getting containers ready for the cloud 2024-08-06 14:56:12 +02:00
Josako
64cf8df3a9 - Improvements to enable deployment in the cloud, mainly changing file access to Minio
- Improvements on RAG logging, and some debugging in that area
2024-08-01 17:35:54 +02:00
Josako
88ca04136d Portkey Integration 2024-07-18 07:08:42 +02:00
Josako
908a2eaf7e - Improve annotation algorithm for Youtube (and others)
- Patch Pytube
- improve OS deletion of files and writing of files
- Start working on Claude
- Improve template management
2024-07-16 14:21:49 +02:00
Josako
ea0127b4b8 Improve algorithms for HTML and PDF processing 2024-07-08 15:20:45 +02:00
Josako
8e1dac0233 Youtube added - further checking required 2024-07-04 08:11:31 +02:00
Josako
7a1b51dd0c Docker deployment Realised 2024-06-26 12:02:14 +02:00
Josako
348bdf2c15 Commit before setting up a new deployment environment 2024-06-21 13:21:54 +02:00
Josako
fd510c8fcd Correcting the retrieval of relevant documents 2024-06-12 16:15:48 +02:00
Josako
be311c440b Improving chat functionality significantly throughout the application. 2024-06-12 11:07:18 +02:00
Josako
27b6de8734 Removing DocumentLanguage, as both System Context and User Context are to be defined on DocumentVersion level.
Finetuning of embedding workers.
2024-06-06 15:26:49 +02:00
Josako
fcc0caeb09 Optimizing admin interface for user domain, completing security views 2024-06-03 09:37:59 +02:00
Josako
ea23e8d327 Setting up the chat client functionality using SocketIO - Start 2024-05-15 14:37:21 +02:00
Josako
011bdce38d Prepare for html document validation (added wanted tags to tenant) 2024-05-12 21:58:42 +02:00
Josako
699de951e8 Add functionality to add a URL to the system. 2024-05-10 22:44:53 +02:00
Josako
d925477e68 Setup of documents view 2024-05-05 20:21:44 +02:00
Josako
0b7f565547 gevent concurrency / asynchronous processes
usage of gunicorn (documented)
2024-04-30 15:50:12 +02:00
Josako
ffa60b4616 Update Document domain models to use pgvector (extension of PostgreSQL) 2024-04-30 15:09:32 +02:00
Josako
f4decbc0cd More dynamic menu handling. Did a lot of stuff using Flask-Nav & Flask_Menu, but removed all of it becauses it became overly complex or the extensions were no longer in active development. 2024-04-28 22:15:47 +02:00
Josako
fd25c39395 Initial commit 2024-04-22 21:23:00 +02:00