Fast Sequence Alignment on Unicode Strings | ansaurus

tags:

views:

33

answers:

1

Q:

Fast Sequence Alignment on Unicode Strings

I want to run something like the BLAST algorithm to query a large database of unicode strings. Most of the alignment software like BLAST expects nucleotide or protein strings as input. But my input could potentially contain any unicode character. Is anyone aware of a piece of software that will let me do this? The scoring matrix could just be the identity matrix (no partial matching.)

I have tried Needleman-Wunsch and Smith Waterman but for my purposes they are too slow. I need to query a large database, as in BLAST.

Thank you!

A:

BLAST can be used to align sequences of characters from any alphabet. You will probably need to implement it yourself, since most of the publicly available implementations are tailored to proteins, but the algorithm is not specific to proteins or nucleotide sequences.

Colin 2010-09-02 20:35:03

related questions

Where to use Zend Framework translation tool

Recommendations for Automated Translation Tools for .NET

i18n - best practices for internationalization - XLIFF, gettext, INI, ... ??

Translating Qt applications

Is there something like a translating database for strings?

Conversion of Fortran 77 code to C++

How do you handle translation of text with markup?

Parse multiple languages in php

What is a Ruby equivalent for Python's "zip" builtin?

Online Translation

Easy way to translate from DTO to Entity and Entity to DTO?

Coding in Other (Spoken) Languages

Software translation services

Seriously, should I write bad PHP code?

How do you implement a multiculture web application

Tool for translation of Oracle PL/SQL into Postgresql PL/pgSQL

Automated Python to Java translation

Algorithm to estimate number of English translation words from Japanese source

What options do you recommend for language translation on content driven Web sites?

Translate algorithmic C to Python

What's the easiest way to use C source code in a Java application?

"Could not load type" in web service converted to VB.NET

How to support multiple languages on a microcontroller?

Can I export translations of place names from freebase.com

Where can I learn more about PyPy's translation function?