Multi-source OSINT data collection and parallel processing tool. Indexes 4chan, Discord and IRC, reorganizes the data into a common format, annotates language, sentiment and tokens in multiple threads, and outputs the results to Elasticsearch.
You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
Mark Veidemanis 508b00e471
Pre-create meta index
1 year ago
docker Add pathogen network 1 year ago
legacy Fix mapping and make Threshold talk to SSDB 1 year ago
processing Fully implement Elasticsearch indexing 1 year ago
schemas Implement threshold writing to Redis and manticore ingesting from Redis 2 years ago
sources Improve memory usage and fix 4chan crawler 2 years ago
.gitignore Update gitignore 2 years ago
.pre-commit-config.yaml Add ripsecrets to pre-commit hook 2 years ago
Makefile Clean up docker environment 2 years ago
db.py Pre-create meta index 1 year ago
env.example Update env example file 1 year ago
monolith.py Use only one Redis key for the queue to make chunk size more precise for thread allocation 2 years ago
requirements.txt Clean up legacy and debugging code 1 year ago
util.py Implement sentiment/NLP annotation and optimise processing 2 years ago