matching

Notepad++ I'm looking for a regexp to select all occurances of 'href="' that do not match 'href="javascript'

This is about the code editor Notepad++. I'm looking for a regular expression that will solve the following problem: I have a set of html files. I need to find all links in them that are not links to javascript functions. If I search for the string 'href="' I get 342 results and if I search for 'href="javascript' I get 301 results. I'd...

Copy between matches, not the lines, but things between match-points

:/From book:/,/$/ cmd_copy chapters_from_match_@From_book@_until_end_of_line.txt I tried not to use words "write", "put" or "read" as VIM has special meaning for them. I try to copy (sorry not referring to VIM's copy-command) the thing between matches to a file. How do you do it, without copying the whole lines? Dummy example TEXT: ...

regex to pull out strings inside #if debug #endif block

Hi I have an application with a large number of #if debug blocks which kind of look like the the one below: #if DEBUG Console.ForegroundColor = ConsoleColor.DarkCyan; Console.WriteLine("oldXml: " + oldXml.OuterXml); Logging.Log("XmlDiff: " + diff_sb.ToString()); ...

What kind of algorithm is better for unordered sequence matching problem?

If I have two sequences (for example, string) // 01234567890123456789012 a = "AAACDDFFFEE1122VV1VAADD" // 0123456789012345678901 b = "DDFFAA11221DHHVV1VAAFE" I want to know the best substring matching (unordered) from b to a, for instance: optimal (6 matched parts, 19 characters of a matched) b a DDFF -> DDFF (4...

Resumee matching algorithm

I am building a job site -- yes, there isn't enough of those yet. One of the problems I came across in my research is how to match the relevant resumes to the interested recruiters. The most boring solution I thought of is to use textual analysis to parse the resumes for tags recruiters specify -- which has a drawback: the resume might b...

Maximum weight bipartite matching

I have a graph in form of a rectangular grid, i.e. N nodes and 2N edges, all adjacent nodes are connected. This means it is two-colourable, and hence it is possible to do bipartite matching on it. Each (undirected) edge has a weight assigned to it - either -2, -1, 0, 1 or 2. No other values are allowed How would I go about finding the ...

Calling search gurus: Numeric range search performance with Lucene?

I'm working on a system that performs matching on large sets of records based on strings and numeric ranges, and date ranges. The String matches are mostly exact matches as far as I can tell, as opposed to less exact full text search type results that I understand lucene is generally designed for. Numeric precision is important as the da...

mysql search exact word with $ in word

I've been trying to find EXACT words and my research isn't helping me. Solutions I've found: $query .= "WHERE text REGEXP '[[:<:]]($word)[[:>:]]'"; // OR $query .= "WHERE MATCH(text) AGAINST('$word') "; but neither are returning my matches. I'm searching for stock symbols in my db (eg $aapl). And using LIKE '%$word%' will return $ba...

R - selection of the kth element of a column

I have performed statistical matching in R. For each case "VAR2002", I have found one or more statistical twin(s) "VAR2004". In R, I have a data frame "TwinWeight" like this: VAR2002 VAR2004 Weight 1 2955 1.00000000 2 3961 1.00000000 3 2913 0.33333333 3 3430 0.33...

perl substitute multiple lines

Hi all, I have a file which looks like: <QUERY name="Secondary"> <SQLStmt>select * from my_tb; </SQLStmt> <QUERY name="primary"> <SQLStmt>select * from my_tb; </SQLStmt> <QUERY name="last"> <SQLStmt>select * from my_tb; </SQLStmt> I need to substitute the SQLStmn...