bearodactyl@twiiku how did you scrape for all the data, could you put it on github as well for those who are curious? Would be interesting to play around with it (search the amount of times 'uwu' was uttered and find Percy's bind takes the gold)
Had the chance to get an entire dump of logs.tf for illuminati matters, I think it contains the entire history up to Jul 4th, that amounts to like 10GB, the contents are basically what you get from calling the JSON API on the site.
bearodactylWould be interesting to see a graph for each person or stats on per year or something cause I definitely cleaned up my act.
I'm sure there's a bunch of cool things to extract from that dump, I'm confident game data would be more interesting than half-baked, context-free offense analysis.
Ombracki dont really see the point of doing something like that, if not to emphasize even more the witch hunt
I did not intend for this to be public or to be used in any capacity, it just leaked from when I talked about it on a few Discord servers. I just had access to data and decided to have "morbid" fun.
smzican you check for 'retard' aswell feel like thats most commonly used to express anger
100M messages (although not all are user-generated, some come from server configs)
282k "r****d"
117k "f****t"
77k "n****r" (I think it was hard R only)
4k "t****y"
albaI'm suprised I had dropped no no word 3 times.
Bear in mind that it's like a very very rough analysis, it's possible that there are false positives.