views:

24

answers:

1

Hi

Does anyone have information on how to create a custom word breaker for SQL Server 2005. I'd prefer to write it in c#. I need to be able to search on terms such as 'c#', 'f#' etc but the '#' character is a word breaker in the English (UK) word breaker component and can't be changed in any other way.

I have found the following article which provides an incomplete (the IWordSink interface is missing) sample and references an article that is no longer available. This article also doesn't provide any of the thread-checking code I'd expect to see. http://bytes.com/topic/sql-server/answers/864969-custom-wordbreaker-sql-server-full-text

Alternately, someone could point me to how to decompile the existing English word breaker 'langwrbk.dll' so I could make the little change I need the existing code?

Thanks

Kirk

+1  A: 

The technology for word breakers and stemmers is common across all the Microsoft Search products, including SQL Server Fulltext. The Search SDK is well and fully documented see Extending the Index and Windows Search Developer's Guide.

Reverse engineering the langwrbk.dll would be against the user license you agreed on, as is very clearly stipulated that you cannot reverse engineer, decompile or disassemble the Software. Not to mention that is code signed and you wouldn't be able to 'make a little change'...

Remus Rusanu
Yes, Remus you are right I shouldn't decompile/disassemble the dll for all those very good reasons. I was just frustrated when I wrote my original post.
Keith K