As a part of my final-year project in undergraduate Computer Science I am looking to write a search engine that ranks websites on validity of standards. I've read several articles and posts about writing a search engine and how difficult it can be, but I'm feeling ambitious and hope that within 5-6 months I would be able to have a working search engine, obviously not commercially viable or working for real users but enough to be able to crawl my own domain of several hundred pages and rank them on their subject and how they conform to W3C standards.
I'd like to give this a try in a language like C#, PHP or Python, but before I get ahead of myself I'd like to know what knowledge one must obtain to be able to undertake such a project and whether it is doable in half a year.
For me this is purely a learning exercise in order to stress what I am capable of. I know that there are many open-source search engine scripts available like Lucene.NET that I could use in a real-world situation but I'd like to give writing one a shot. Do you think that a final-year undergrad is capable of writing a functional search engine for a small website?
EDIT: This search engine would be an online search engine, usable through a web page front-end. I'd only want to crawl the web pages on a dummy website I've put up, consisting of no more than fifty pages for now. The idea is to use several metrics to determine what website is best from a design perspective, most notably by using a code validator.