Commit Graph

445 Commits

Author SHA1 Message Date
3ed382ec13 Implement restricted sources 2022-09-18 13:01:19 +01:00
dab5e81715 Fix merge conflict 2022-09-16 17:45:24 +01:00
143f2a0bf0 Implement sentiment/NLP annotation and optimise processing 2022-09-16 17:09:49 +01:00
4ea77ac543 Properly process Redis buffered messages and ingest into Kafka 2022-09-14 18:32:32 +01:00
fec0d379a6 Ingest into Kafka and queue messages better 2022-09-13 22:17:46 +01:00
3c2adfc16e Implement Apache Druid/Kafka and Metabase 2022-09-13 22:17:32 +01:00
4c6fe87b88 Switch to latest image for dev docker-compose 2022-09-13 09:20:43 +01:00
79a430be04 Begin implementing Apache Druid 2022-09-08 07:20:30 +01:00
baea6aebeb Use stable after all 2022-09-08 07:20:30 +01:00
eaecc5cdbe Switch production image back to dev 2022-09-08 07:20:30 +01:00
764e36ef14 Lower memory requirements to prevent crashes 2022-09-08 07:20:30 +01:00
50a873dbba Set dev image back to the default 2022-09-12 08:43:18 +01:00
21182629b4 Treat text fields as string and try beta Kibana image 2022-09-12 08:27:13 +01:00
dfd71b6c64 Add Mysql port to ports instead of expose 2022-09-10 13:20:06 +01:00
1b0817b047 Expose the Mysql port 2022-09-10 13:16:19 +01:00
0ba4929294 Use dev image of manticore 2022-09-10 12:03:45 +01:00
caded433b7 Remove indexer block to attempt to prevent Manticore DB crash 2022-09-08 07:20:30 +01:00
bf802d7fdf Reformat 2022-09-07 07:20:30 +01:00
89328a827a Raise open files limit for Redis 2022-09-07 07:20:30 +01:00
32249a1d99 Add 4chan update message type to main types 2022-09-07 07:20:30 +01:00
cdd12cd082 Implement threshold writing to Redis and manticore ingesting from Redis 2022-09-07 07:20:30 +01:00
137299fe9e Add config directories to gitignore 2022-09-08 09:45:18 +01:00
2aedcf77a0 Add aioredis 2022-09-08 09:44:27 +01:00
49784dfbe5 Implement ingesting to Redis from Threshold 2022-09-07 07:20:30 +01:00
a6b5348224 Config relative to Git dir 2022-09-05 07:20:30 +01:00
d0fe2baafe Store persistent database elsewhere 2022-09-05 07:20:30 +01:00
e092327932 Improve DB performance with caching 2022-09-05 07:20:30 +01:00
8b9ad05089 Reformat legacy project 2022-09-05 07:20:30 +01:00
6b082adeb2 Merge branch 'threshold' 2022-09-06 12:50:25 +01:00
bd9f9378cf Moved files to subdirectory 2022-09-06 12:50:09 +01:00
62fe03a6cb Increase thread delay time 2022-09-05 07:20:30 +01:00
297bbbe035 Alter schemas and 4chan performance settings 2022-09-05 07:20:30 +01:00
ed7c439b56 Remove some debugging code 2022-09-05 07:20:30 +01:00
ecb8079b5b Change Python to 3.10 2022-09-05 07:20:30 +01:00
6811ce4af5 Update production env file path 2022-09-05 07:20:30 +01:00
e34d281774 Remove development dotenv loading 2022-09-05 07:20:30 +01:00
91e18c60e6 Add debug statement 2022-09-05 07:20:30 +01:00
9c9d49dcd2 Reformat and set the net and channel for 4chan 2022-09-05 07:20:30 +01:00
dcd648e1d2 Make crawler more efficient and implement configurable parameters 2022-09-05 07:20:30 +01:00
318a8ddbd5 Split thread list into chunks to save memory 2022-09-05 07:20:30 +01:00
20e22ae7ca Reformat code 2022-09-04 21:40:04 +01:00
8feccbbf00 Reinstate Redis cache 2022-09-04 21:38:53 +01:00
db46fea550 Run processing in thread 2022-09-04 21:29:00 +01:00
22cef33342 Implement aiohttp 2022-09-04 19:44:25 +01:00
663a26778d Begin implementing aiohttp 2022-09-04 13:47:32 +01:00
36de004ee5 Implement running Discord and 4chan gathering simultaneously 2022-09-02 22:30:45 +01:00
2c3d83fe9a Fix error when no email can be found 2022-08-27 11:19:28 +01:00
d7adffb47f Fix getting first relay when they are not sequential 2022-08-26 22:17:12 +01:00
4f4820818a Log authentication messages 2022-08-16 23:01:42 +01:00
5cc38da00e Implement deduplicating channels 2022-08-16 22:01:35 +01:00