Let's say you have access to an email account with the history of received emails from the last years (~10k emails) classified into 2 groups
- genuine email
- spam
How would you approach the task of creating a neural network solution that could be used for spam detection - basically classifying any email either as spam or not spam?
Let's assume that the email fetching is already in place and we need to focus on classification part only.
The main points which I would hope to get answered would be:
- Which parameters to choose as the input for the NN, and why?
- What structure of the NN would most likely work best for such task?
Also any resource recommendations, or existing implementations (preferably in C#) are more than welcome
Thank you
EDIT
- I am set on using neural networks as the main aspect on the project is to test how the NN approach would work for spam detection
- Also it is a "toy problem" simply to explore subject on neural networks and spam
- I should also mention that this is simply an exercise that my nephew came out with, and I was just asked for some advice. He is not a programmer by profession but with a pretty good programming skills. He simply wants to use that as a way to keep up with his programming skills and to explore the NN. His mind is very much set on "spam detection" in this context as well.