Commit Graph

480 Commits

Author SHA1 Message Date
Mark Veidemanis 06e80a9759 Time stuff and switch to gensim for tokenisation 2022-10-01 14:46:45 +01:00
Mark Veidemanis 5c91f1af87 Remove commented debug code 2022-09-30 07:22:22 +01:00
Mark Veidemanis 02ff44a6f5 Use only one Redis key for the queue to make chunk size more precise for thread allocation 2022-09-30 07:22:22 +01:00
Mark Veidemanis a5d29606e9 Remove ujson 2022-09-30 15:30:34 +01:00
Mark Veidemanis 6b549dee6a Reformat 2022-09-30 15:23:00 +01:00
Mark Veidemanis 2dd2360b4f Add config file to Turnilo 2022-09-27 08:30:28 +01:00
Mark Veidemanis a2f88e29e6 Implement uvloop 2022-09-23 07:20:30 +01:00
Mark Veidemanis f0df3e80fd Print Ingest settings on start 2022-09-23 08:32:29 +01:00
Mark Veidemanis 09fc63d0ad Make debug output cleaner 2022-09-22 17:39:29 +01:00
Mark Veidemanis e9ae499ce8 Fix indexer options 2022-09-22 17:39:18 +01:00
Mark Veidemanis b6f8dabccd Fix Java variable in indexer parameters 2022-09-22 08:41:59 +01:00
Mark Veidemanis 395dfb1e7b Decrease memory requirements further and switch Kafka image 2022-09-21 21:11:13 +01:00
Mark Veidemanis ee79762c73 Set Kafka max heap size 2022-09-21 20:26:05 +01:00
Mark Veidemanis e58b9960b2 Set max memory for Metabase 2022-09-21 14:39:11 +01:00
Mark Veidemanis 4a60dec964 Remove debugging code and fix regex substitution 2022-09-21 12:48:54 +01:00
Mark Veidemanis 9ee55a720b Change dev container names 2022-09-21 12:09:18 +01:00
Mark Veidemanis 799286ca76 Change prod container names 2022-09-21 12:08:29 +01:00
Mark Veidemanis 0e62a5b4b8 Remove prod compose comment 2022-09-21 12:04:54 +01:00
Mark Veidemanis 5ebae02bf2 Remove commented code for debugging 2022-09-21 10:02:05 +01:00
Mark Veidemanis ced3a251b2 Normalise fields in processing and remove invalid characters 2022-09-21 10:01:12 +01:00
Mark Veidemanis 740f93208b Make production volumes point to external storage 2022-09-21 10:00:48 +01:00
Mark Veidemanis 2763e52e6b Don't muddle up the topics when sending Kafka batches 2022-09-20 23:03:02 +01:00
Mark Veidemanis 869af451e5 Document new PROCESS_THREADS setting in example file 2022-09-20 22:43:04 +01:00
Mark Veidemanis 31c58dd85b Make CPU threads configurable 2022-09-20 22:29:13 +01:00
Mark Veidemanis 40a0c2d22e Make performance settings configurable 2022-09-20 22:22:13 +01:00
Mark Veidemanis 9f4d4784af Set memory size to 2.5GB 2022-09-08 07:20:30 +01:00
Mark Veidemanis 72c22ed91e Update DirectMemorySize to be 1.5GB 2022-09-19 21:51:07 +01:00
Mark Veidemanis ce62a84cec Make MaxDirectMemory 0.5*cores 2022-09-19 19:15:57 +01:00
Mark Veidemanis 41b5ca6afd Make max memory size 512m 2022-09-19 19:10:33 +01:00
Mark Veidemanis 7db3504251 Further decrease Druid memory requirements 2022-09-19 17:07:15 +01:00
Mark Veidemanis 1284700e61 Bump production Kafka healthcheck timeout 2022-09-19 11:18:52 +01:00
Mark Veidemanis a9803fc79c Decrease production Druid max memory size 2022-09-19 10:51:34 +01:00
Mark Veidemanis d4861811e5 Increase Kafka retries 2022-09-19 10:48:29 +01:00
Mark Veidemanis 3c2e8e8e67 Change Metabase port 2022-09-18 13:15:10 +01:00
Mark Veidemanis f60c08918e Add docker environment file 2022-09-18 13:05:08 +01:00
Mark Veidemanis 0d6b3763f9 Update production compose 2022-09-18 13:04:08 +01:00
Mark Veidemanis d4b8e11525 Reformat comment 2022-09-18 13:02:06 +01:00
Mark Veidemanis 38d00f2c21 Implement restricted sources 2022-09-18 13:01:19 +01:00
Mark Veidemanis cb11ce9b12 Fix merge conflict 2022-09-16 17:45:24 +01:00
Mark Veidemanis a89b5a8b6f Implement sentiment/NLP annotation and optimise processing 2022-09-16 17:09:49 +01:00
Mark Veidemanis f432e9b29e Properly process Redis buffered messages and ingest into Kafka 2022-09-14 18:32:32 +01:00
Mark Veidemanis c5f01c3084 Ingest into Kafka and queue messages better 2022-09-13 22:17:46 +01:00
Mark Veidemanis 47c5f89914 Implement Apache Druid/Kafka and Metabase 2022-09-13 22:17:32 +01:00
Mark Veidemanis 68fd5fa230 Switch to latest image for dev docker-compose 2022-09-13 09:20:43 +01:00
Mark Veidemanis fd90c233c2 Begin implementing Apache Druid 2022-09-08 07:20:30 +01:00
Mark Veidemanis 0eb4a04b89 Use stable after all 2022-09-08 07:20:30 +01:00
Mark Veidemanis e196172e04 Switch production image back to dev 2022-09-08 07:20:30 +01:00
Mark Veidemanis 41a8cea873 Lower memory requirements to prevent crashes 2022-09-08 07:20:30 +01:00
Mark Veidemanis 9cf4e945d1 Set dev image back to the default 2022-09-12 08:43:18 +01:00
Mark Veidemanis 04b5dec843 Treat text fields as string and try beta Kibana image 2022-09-12 08:27:13 +01:00