views:

398

answers:

1

I'm beginning my foray into fulltext search for MSSQL2005. This is a new, in-dev database, so I can monkey around with it.

Let's say I have a "Form" table that has a one-to-many relationship to two other tables, "Answer" and "Comment." I want the user to be able to enter one search term and have it peg all three of those tables via fulltext search. What's the best way? It appears as though I can create an indexed view to speed things along.

If I do use an indexed view (is this the best route anyway?), how do I handle the one-to-many relationships? Let's assume all I really want to figure out is the ID of the "Form" table that would contain any search results found in the child Answer/Comment tables, so I can show the entire form that contains the located entries. An inner join on the view would result in multiple "master" form rows being returned - doesn't sound great for performance. How about concatenating all child answer/comment rows into one "column" for the view? So instead of....

Form 1, Answer A, Comment A 
Form 1, Answer B, Comment A 
Form 1, Answer A, Comment B 
Form 1, Answer B, Comment B

...it'd be...

Form 1, Answer A Answer B, Comment A Comment B

Remember all I care about is the ID of that master "form" row.

This seems logical to me, but I'm not sure the best practices around fulltext search yet, so just curious if I'm approaching it correctly.

UPDATE: Looks like indexed views are pretty strict; no left joins, no subqueries, so I might go down the path of a summary table populated by triggers if it isn't going to be too unwieldy.

+1  A: 

In SQL you should use this rule of thumb: - start with the element you want to retrieve - add filters to limit the set of elements to retrieve

that sounds like a no-brainer, but it's really all you have to do. Here, you want Form elements (rows), so you write:

SELECT ID
FROM Form

and then you add the filters to limit the total set to the set of forms you're interested in. You do this by using IN predicates. As you say Answer and Comment have a 1:n relationship with Form, this means that Answer and Comment have an FK to Form.ID. So we can add the where:

WHERE ID IN
(SELECT FormID FROM Answer WHERE CONTAINS(FullTextColumnInAnswer, @stringToSearch))
OR
ID IN
(SELECT FormID FROM Comment WHERE CONTAINS(FullTextColumnInComment, @stringToSearch))

this will lead to a set of Form.ID values which are distinct and which are the IDs of forms which have at least 1 answer OR comment which contains the string to search in the field which is full-text-search enabled.

On SqlServer 2000 this can be slower than on 2005, as you use multiple CONTAINS statements.

You can also use CONTAINSTABLE() in join scenario's instead, see the BOL for examples, though I'd go for the IN clause setup first to get things working.

Frans Bouma
I think perhaps I'm suffering from a case of premature optimization, so I think I'll run with the simple things like this first and see where that gets me.
Chris