Multi-source OSINT data collection and parallel processing tool. Indexes 4chan, Discord and IRC, reorganizes the data into a common format, annotates language, sentiment and tokens in multiple threads, and outputs the results to Elasticsearch.
You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
Mark Veidemanis a89b5a8b6f Implement sentiment/NLP annotation and optimise processing 2 years ago
docker Implement sentiment/NLP annotation and optimise processing 2 years ago
legacy Treat text fields as string and try beta Kibana image 2 years ago
processing Implement sentiment/NLP annotation and optimise processing 2 years ago
schemas Implement threshold writing to Redis and manticore ingesting from Redis 2 years ago
sources Implement sentiment/NLP annotation and optimise processing 2 years ago
.gitignore Add config directories to gitignore 2 years ago
.pre-commit-config.yaml Reinstate Redis cache 2 years ago
db.py Implement sentiment/NLP annotation and optimise processing 2 years ago
docker-compose.yml Implement sentiment/NLP annotation and optimise processing 2 years ago
environment Implement Apache Druid/Kafka and Metabase 2 years ago
event_log.txt Implement sentiment/NLP annotation and optimise processing 2 years ago
monolith.py Implement sentiment/NLP annotation and optimise processing 2 years ago
requirements.txt Implement sentiment/NLP annotation and optimise processing 2 years ago
util.py Implement sentiment/NLP annotation and optimise processing 2 years ago