Multi-source OSINT data collection and parallel processing tool. Indexes 4chan, Discord and IRC, reorganizes the data into a common format, annotates language, sentiment and tokens in multiple threads, and outputs the results to Elasticsearch.
Go to file
Mark Veidemanis b6f8dabccd Fix Java variable in indexer parameters 2022-09-22 08:41:59 +01:00
docker Fix Java variable in indexer parameters 2022-09-22 08:41:59 +01:00
legacy Remove debugging code and fix regex substitution 2022-09-21 12:48:54 +01:00
processing Remove debugging code and fix regex substitution 2022-09-21 12:48:54 +01:00
schemas Implement threshold writing to Redis and manticore ingesting from Redis 2022-09-07 07:20:30 +01:00
sources Remove commented code for debugging 2022-09-21 10:02:05 +01:00
.gitignore Add config directories to gitignore 2022-09-08 09:45:18 +01:00
.pre-commit-config.yaml Reinstate Redis cache 2022-09-04 21:38:53 +01:00
db.py Remove commented code for debugging 2022-09-21 10:02:05 +01:00
docker-compose.yml Decrease memory requirements further and switch Kafka image 2022-09-21 21:11:13 +01:00
env.example Document new PROCESS_THREADS setting in example file 2022-09-20 22:43:04 +01:00
environment Fix Java variable in indexer parameters 2022-09-22 08:41:59 +01:00
event_log.txt Implement sentiment/NLP annotation and optimise processing 2022-09-16 17:09:49 +01:00
monolith.py Remove commented code for debugging 2022-09-21 10:02:05 +01:00
requirements.txt Implement sentiment/NLP annotation and optimise processing 2022-09-16 17:09:49 +01:00
util.py Implement sentiment/NLP annotation and optimise processing 2022-09-16 17:09:49 +01:00