Commit Graph

493 Commits

Author SHA1 Message Date
Mark Veidemanis 38d00f2c21 Implement restricted sources 2022-09-18 13:01:19 +01:00
Mark Veidemanis cb11ce9b12 Fix merge conflict 2022-09-16 17:45:24 +01:00
Mark Veidemanis a89b5a8b6f Implement sentiment/NLP annotation and optimise processing 2022-09-16 17:09:49 +01:00
Mark Veidemanis f432e9b29e Properly process Redis buffered messages and ingest into Kafka 2022-09-14 18:32:32 +01:00
Mark Veidemanis c5f01c3084 Ingest into Kafka and queue messages better 2022-09-13 22:17:46 +01:00
Mark Veidemanis 47c5f89914 Implement Apache Druid/Kafka and Metabase 2022-09-13 22:17:32 +01:00
Mark Veidemanis 68fd5fa230 Switch to latest image for dev docker-compose 2022-09-13 09:20:43 +01:00
Mark Veidemanis fd90c233c2 Begin implementing Apache Druid 2022-09-08 07:20:30 +01:00
Mark Veidemanis 0eb4a04b89 Use stable after all 2022-09-08 07:20:30 +01:00
Mark Veidemanis e196172e04 Switch production image back to dev 2022-09-08 07:20:30 +01:00
Mark Veidemanis 41a8cea873 Lower memory requirements to prevent crashes 2022-09-08 07:20:30 +01:00
Mark Veidemanis 9cf4e945d1 Set dev image back to the default 2022-09-12 08:43:18 +01:00
Mark Veidemanis 04b5dec843 Treat text fields as string and try beta Kibana image 2022-09-12 08:27:13 +01:00
Mark Veidemanis 40492b1595 Add Mysql port to ports instead of expose 2022-09-10 13:20:06 +01:00
Mark Veidemanis 90fed411e6 Expose the Mysql port 2022-09-10 13:16:19 +01:00
Mark Veidemanis 0dde7d6f30 Use dev image of manticore 2022-09-10 12:03:45 +01:00
Mark Veidemanis fed3cdbf5a Remove indexer block to attempt to prevent Manticore DB crash 2022-09-08 07:20:30 +01:00
Mark Veidemanis c2bdb3fd15 Reformat 2022-09-07 07:20:30 +01:00
Mark Veidemanis 5d042c1259 Raise open files limit for Redis 2022-09-07 07:20:30 +01:00
Mark Veidemanis 92475ee9a9 Add 4chan update message type to main types 2022-09-07 07:20:30 +01:00
Mark Veidemanis 5c3b338017 Implement threshold writing to Redis and manticore ingesting from Redis 2022-09-07 07:20:30 +01:00
Mark Veidemanis 54ea5fa8e9 Add config directories to gitignore 2022-09-08 09:45:18 +01:00
Mark Veidemanis e79de2b377 Add aioredis 2022-09-08 09:44:27 +01:00
Mark Veidemanis 79b1bee9e4 Implement ingesting to Redis from Threshold 2022-09-07 07:20:30 +01:00
Mark Veidemanis e3b2e1f36d Config relative to Git dir 2022-09-05 07:20:30 +01:00
Mark Veidemanis eb71dd76f8 Store persistent database elsewhere 2022-09-05 07:20:30 +01:00
Mark Veidemanis cd5eb61455 Improve DB performance with caching 2022-09-05 07:20:30 +01:00
Mark Veidemanis a198f2a487 Reformat legacy project 2022-09-05 07:20:30 +01:00
Mark Veidemanis 19ee7071f5 Merge branch 'threshold' 2022-09-06 12:50:25 +01:00
Mark Veidemanis 122fdca5db Moved files to subdirectory 2022-09-06 12:50:09 +01:00
Mark Veidemanis 7bb2264d91 Increase thread delay time 2022-09-05 07:20:30 +01:00
Mark Veidemanis 1858e06c4b Alter schemas and 4chan performance settings 2022-09-05 07:20:30 +01:00
Mark Veidemanis ddcfa614ad Remove some debugging code 2022-09-05 07:20:30 +01:00
Mark Veidemanis a1b193c6da Change Python to 3.10 2022-09-05 07:20:30 +01:00
Mark Veidemanis 7606d77c2b Update production env file path 2022-09-05 07:20:30 +01:00
Mark Veidemanis aaf2d58d86 Remove development dotenv loading 2022-09-05 07:20:30 +01:00
Mark Veidemanis d7e49777ef Add debug statement 2022-09-05 07:20:30 +01:00
Mark Veidemanis d1c6bd1fb5 Reformat and set the net and channel for 4chan 2022-09-05 07:20:30 +01:00
Mark Veidemanis b8d2ecc009 Make crawler more efficient and implement configurable parameters 2022-09-05 07:20:30 +01:00
Mark Veidemanis f8fc5e1a1b Split thread list into chunks to save memory 2022-09-05 07:20:30 +01:00
Mark Veidemanis 6e00f70184 Reformat code 2022-09-04 21:40:04 +01:00
Mark Veidemanis 0f717b987d Reinstate Redis cache 2022-09-04 21:38:53 +01:00
Mark Veidemanis 60c43b4eb5 Run processing in thread 2022-09-04 21:29:00 +01:00
Mark Veidemanis db23b31f30 Implement aiohttp 2022-09-04 19:44:25 +01:00
Mark Veidemanis f7860bf08b Begin implementing aiohttp 2022-09-04 13:47:32 +01:00
Mark Veidemanis 734a2b7879 Implement running Discord and 4chan gathering simultaneously 2022-09-02 22:30:45 +01:00
Mark Veidemanis 2731713ede Fix error when no email can be found 2022-08-27 11:19:28 +01:00
Mark Veidemanis 2a2f24f570 Fix getting first relay when they are not sequential 2022-08-26 22:17:12 +01:00
Mark Veidemanis c7941bfcda Log authentication messages 2022-08-16 23:01:42 +01:00
Mark Veidemanis 49b0b9db46 Implement deduplicating channels 2022-08-16 22:01:35 +01:00