Commit Graph

50 Commits

Author SHA1 Message Date
Mark Veidemanis 808ed18b74
Switch quickstart setting to nano 2022-10-04 20:37:02 +01:00
Mark Veidemanis 34e589aa9c
Set Superset env file relative to docker directory 2022-10-04 20:30:14 +01:00
Mark Veidemanis cc6340acab
Add persistent Redis data store and copy over Druid config to production 2022-10-04 20:26:58 +01:00
Mark Veidemanis 7b73229d5a
Add Apache Superset and fix Druid resource usage 2022-10-04 20:17:04 +01:00
Mark Veidemanis 35ba2cc947
Add postgres config to Metabase 2022-10-02 14:29:40 +01:00
Mark Veidemanis 817bfd8835
Time stuff and switch to gensim for tokenisation 2022-10-01 14:46:45 +01:00
Mark Veidemanis 63081f68b7
Use only one Redis key for the queue to make chunk size more precise for thread allocation 2022-09-30 07:22:22 +01:00
Mark Veidemanis 5992498493
Remove ujson 2022-09-30 15:30:34 +01:00
Mark Veidemanis a8dbabd85e
Implement uvloop 2022-09-23 07:20:30 +01:00
Mark Veidemanis 0e9a016e2a
Fix indexer options 2022-09-22 17:39:18 +01:00
Mark Veidemanis 763501d1ee
Fix Java variable in indexer parameters 2022-09-22 08:41:59 +01:00
Mark Veidemanis 40a215e6ec
Decrease memory requirements further and switch Kafka image 2022-09-21 21:11:13 +01:00
Mark Veidemanis 7abf9a00cb
Set Kafka max heap size 2022-09-21 20:26:05 +01:00
Mark Veidemanis bd3f1ecd53
Set max memory for Metabase 2022-09-21 14:39:11 +01:00
Mark Veidemanis 00890860c0
Change prod container names 2022-09-21 12:08:29 +01:00
Mark Veidemanis b0efaeef90
Remove prod compose comment 2022-09-21 12:04:54 +01:00
Mark Veidemanis 48e4c07959
Make production volumes point to external storage 2022-09-21 10:00:48 +01:00
Mark Veidemanis 24929a5fbb
Set memory size to 2.5GB 2022-09-08 07:20:30 +01:00
Mark Veidemanis f336d96268
Update DirectMemorySize to be 1.5GB 2022-09-19 21:51:07 +01:00
Mark Veidemanis 315e477916
Make MaxDirectMemory 0.5*cores 2022-09-19 19:15:57 +01:00
Mark Veidemanis 006677819d
Make max memory size 512m 2022-09-19 19:10:33 +01:00
Mark Veidemanis 93a0be98ce
Further decrease Druid memory requirements 2022-09-19 17:07:15 +01:00
Mark Veidemanis 14322f5090
Bump production Kafka healthcheck timeout 2022-09-19 11:18:52 +01:00
Mark Veidemanis d94da5ac5c
Decrease production Druid max memory size 2022-09-19 10:51:34 +01:00
Mark Veidemanis a1382ee46d
Increase Kafka retries 2022-09-19 10:48:29 +01:00
Mark Veidemanis 5e6b962ea8
Change Metabase port 2022-09-18 13:15:10 +01:00
Mark Veidemanis e8dd847b36
Add docker environment file 2022-09-18 13:05:08 +01:00
Mark Veidemanis d68bcfaebd
Update production compose 2022-09-18 13:04:08 +01:00
Mark Veidemanis 143f2a0bf0
Implement sentiment/NLP annotation and optimise processing 2022-09-16 17:09:49 +01:00
Mark Veidemanis 3c2adfc16e
Implement Apache Druid/Kafka and Metabase 2022-09-13 22:17:32 +01:00
Mark Veidemanis baea6aebeb
Use stable after all 2022-09-08 07:20:30 +01:00
Mark Veidemanis eaecc5cdbe
Switch production image back to dev 2022-09-08 07:20:30 +01:00
Mark Veidemanis 764e36ef14
Lower memory requirements to prevent crashes 2022-09-08 07:20:30 +01:00
Mark Veidemanis 21182629b4
Treat text fields as string and try beta Kibana image 2022-09-12 08:27:13 +01:00
Mark Veidemanis dfd71b6c64
Add Mysql port to ports instead of expose 2022-09-10 13:20:06 +01:00
Mark Veidemanis 1b0817b047
Expose the Mysql port 2022-09-10 13:16:19 +01:00
Mark Veidemanis 0ba4929294
Use dev image of manticore 2022-09-10 12:03:45 +01:00
Mark Veidemanis caded433b7
Remove indexer block to attempt to prevent Manticore DB crash 2022-09-08 07:20:30 +01:00
Mark Veidemanis 89328a827a
Raise open files limit for Redis 2022-09-07 07:20:30 +01:00
Mark Veidemanis cdd12cd082
Implement threshold writing to Redis and manticore ingesting from Redis 2022-09-07 07:20:30 +01:00
Mark Veidemanis 2aedcf77a0
Add aioredis 2022-09-08 09:44:27 +01:00
Mark Veidemanis a6b5348224
Config relative to Git dir 2022-09-05 07:20:30 +01:00
Mark Veidemanis d0fe2baafe
Store persistent database elsewhere 2022-09-05 07:20:30 +01:00
Mark Veidemanis e092327932
Improve DB performance with caching 2022-09-05 07:20:30 +01:00
Mark Veidemanis ecb8079b5b
Change Python to 3.10 2022-09-05 07:20:30 +01:00
Mark Veidemanis 6811ce4af5
Update production env file path 2022-09-05 07:20:30 +01:00
Mark Veidemanis 318a8ddbd5
Split thread list into chunks to save memory 2022-09-05 07:20:30 +01:00
Mark Veidemanis db46fea550
Run processing in thread 2022-09-04 21:29:00 +01:00
Mark Veidemanis 663a26778d
Begin implementing aiohttp 2022-09-04 13:47:32 +01:00
Mark Veidemanis 36de004ee5
Implement running Discord and 4chan gathering simultaneously 2022-09-02 22:30:45 +01:00