Many people are tired of obtrusive words with no value, like these:
- f**king
- Id|ot
- <|>
- whaaaat????!!!!???
I plan to detect suspicious records and then to verify them manually. In other words, to find rules which detect that something is most likely obtrusive. Is there any reasonable solution? I am thinking about these REGEX rules:
\w\W+\w
\D{3,}
Can you recommend some other rules or links to read?
Edit
Because of "Hello, my friend"
the first rule could be
\w[^\w\s]+\w