Hello all-knowing co-stackers.
Our masters thesis project is creating a database schema analyzer. As a foundation to this, we are working on quantifying bad database design.
Our supervisor has tasked us with analyzing a real world schema, of our choosing, such that we can identify some/several design issues. These issues are to be used as a starting point in the schema analyzer.
Finding a good schema is a bit difficult because we do not want a schema which is well designed in all aspects, but a schema that is more "rare to medium".
We have already scheduled the following schemas for analysis: wikimedia, moodle and drupal. Not sure in which category each fit. It is not necessary that the schema is open source.
The database engine used is not important, though we would like to focus on SQL server, Posgresql and Oracle.
Thanks to all in advance. :)
EDIT: Great responses already! I will surely work my way through them. For now literature will be deferred, as this task is supposed to give us real world examples which can be used in the thesis. I.e. "Design X is perceived by us as bad design, which our analyzer identifies and suggests improvements to", instead of coming up with contrived examples.
EDIT2: Thank you all for your responses. I have accepted the post which we, at the end, found most beneficial. I will update this post when we have some kind of a tool ready :)