ansaurus

Question

Answer 1

A:

You regex would be something like this

/.*word1=(\w+)/

Lex 2010-09-29 16:19:59

This also doesn't work

NullUserException 2010-09-29 16:24:33

If you edit your answer, it would be nice to comment about it. I was confused for a while why this wouldn't work. Though the starting `.*` is still pointless.

teukkam 2010-09-29 16:31:14

Answer 2

A:

Use: /word1=(\w+)/

Ruel 2010-09-29 16:20:04

Yep, thanks about that. Edited. The non-greedy matching caused the regex to match a single character only. :P

Ruel 2010-09-29 16:26:23

Answer 3

+3 A:

Given the following regex...

/word1=(\w+)/

...$1 or whatever your first match is in your language will be wrestle.

Dave Pirotte 2010-09-29 16:20:41

in python, what should it look like? thanks

James Eggers 2010-09-29 16:25:18

I believe it's `result = re.match(pattern, string)`

Ruel 2010-09-29 16:28:28

@James see my answer

NullUserException 2010-09-29 16:31:26

@Ruel: You want `re.search()`, not `re.match()`. The latter always anchors the search to the start of the string.

Tim Pietzcker 2010-09-29 16:32:18

thanks @NullUserException, it works :)

James Eggers 2010-09-29 16:45:59

Answer 4

A:

Assuming it is always separated by spaces

word1=([^ ]+)

Then you can get the value by the first group match.

BrunoLM 2010-09-29 16:21:17

Answer 5

+5 A:

The question is not very clear, but I guess this is what you are looking for:

word1=(\w+)

Your match will be in the 1st group. Here's some sample Python code:

import re
yourstring = 'type=weaksubj len=1 word1=wrestle pos1=verb stemmed1=y priorpolarity=negative'

m = re.search(r'word1=(\w+)', yourstring)
print m.group(1)

As seen on codepad. A more generalized solution:

import re
def get_attr(str, attr):
    m = re.search(attr + r'=(\w+)', str)
    return None if not m else m.group(1)

str = 'type=weaksubj len=1 word1=wrestle pos1=verb stemmed1=y priorpolarity=negative'

print get_attr(str, 'word1')  # wrestle
print get_attr(str, 'type')   # weaksubj
print get_attr(str, 'foo')    # None

Also available on codepad

NullUserException 2010-09-29 16:21:58

thanks, that worked :)

James Eggers 2010-09-29 16:45:31

Great answer. +1

Ruel 2010-09-29 23:38:46

Answer 6

A:

Maybe re is unnecessary when str.split looks like it will suffice:

>>> s = "type=weaksubj len=1 word1=wrestle pos1=verb stemmed1=y priorpolarity=negative"
>>> dd = dict(ss.split('=',1) for ss in s.split())
>>> dd['word1']
'wrestle'

Paul McGuire 2010-09-29 21:47:53

ansaurus

tags:

views:

answers:

Regular Expressions...

related questions