Multi-source OSINT data collection and parallel processing tool. Indexes 4chan, Discord and IRC, reorganizes the data into a common format, annotates language, sentiment and tokens in multiple threads, and outputs the results to Elasticsearch.
Go to file
Mark Veidemanis fec0d379a6
Ingest into Kafka and queue messages better
2022-09-13 22:17:46 +01:00
docker Implement Apache Druid/Kafka and Metabase 2022-09-13 22:17:32 +01:00
legacy Treat text fields as string and try beta Kibana image 2022-09-12 08:27:13 +01:00
processing Ingest into Kafka and queue messages better 2022-09-13 22:17:46 +01:00
schemas Implement threshold writing to Redis and manticore ingesting from Redis 2022-09-07 07:20:30 +01:00
sources Ingest into Kafka and queue messages better 2022-09-13 22:17:46 +01:00
.gitignore Add config directories to gitignore 2022-09-08 09:45:18 +01:00
.pre-commit-config.yaml Reinstate Redis cache 2022-09-04 21:38:53 +01:00
db.py Ingest into Kafka and queue messages better 2022-09-13 22:17:46 +01:00
docker-compose.yml Implement Apache Druid/Kafka and Metabase 2022-09-13 22:17:32 +01:00
environment Implement Apache Druid/Kafka and Metabase 2022-09-13 22:17:32 +01:00
monolith.py Ingest into Kafka and queue messages better 2022-09-13 22:17:46 +01:00
requirements.txt Implement Apache Druid/Kafka and Metabase 2022-09-13 22:17:32 +01:00
util.py Run processing in thread 2022-09-04 21:29:00 +01:00