ansaurus

Question

Python - to check if a char is in dictionary and if not to deal with it

Answer 1

+1 A:

I think you want something like this:

tokenMapping = {"&&" : "and"}

for token in source file: # <-- pseudocode
    translatedToken = tokenMapping[token] if token in tokenMapping else "transliteration unknown"

If there's a translation in the dictionary (e.g. "&&" -> "and"), it will use that. Else it will translate to "transliteration unknown".

Hope that helped.

EDIT: As LeafStorm suggested, a dictionary's get function can be used to simplify the above code. The code line in the loop would become

    translatedToken = tokenMapping.get(token, "transliteration unknown")

AndiDog 2010-02-13 14:15:07

will just check it and get back to you Sir..

mgj 2010-02-13 14:26:52

I need to run the entire code, I am currently dealing with certain errors will surely get back to you sir Thank you for you time:)

mgj 2010-02-13 15:01:22

Answer 2

A:

dictx = {}
for itm in my_source :
    dictx[itm] = dictx.get(itm, 0) + 1

I didn't completely understand the details of your question, but here's the simplest example i could think of that illustrates the pattern i think you are after.

The 'get' method i believe is what you want. It allows you to retrieve a key from a dictionary, but if the key is not there, you can set a default value--i.e., "i want dictx[itm] (the value assigned to the key 'itm') but if 'itm' is not in dictionary then create it and value of .'

This snippet will loop through your source document ('my_source') and count the frequency of the various items in it, adding those counts as values to the keys already in your dictionary, but when it reaches an item for which no key exists, no exception is thrown, a key is added and a value of '0' assigned.

doug 2010-02-13 14:37:29

Let me give you an e.g. Sir.. Say the source file contaings "Hi! What are you doing" Now I need to check for each char or a set of char and see for their equivalent transliteration in a dictionary, but certain characters like '!' are to be copied as it is from source to destination and they have no equivalent in transliteration but their original forms.. My question was how to check if its in the dictionary and print its equivalent if any that exists, and if not how to print the original char(like '!') as it is if no equivalent is in the dictionary. Thank you for your support Sir..:)

mgj 2010-02-13 14:43:14

Answer 3

+3 A:

My recommendation, given that rules is a mapping of the characters to their transliterated equivalents:

results = []
for char in source_text:
    results.append(rules.get(char, char))
return ''.join(results)    # turns the list back into a string

A dict's get method will return either the value for a key or a default value if the key does not exist - normally the default value is None, but in this case, we gave the same character as the default value (the second argument) so that if the key is not found it will just return itself.

A more compact way to write this using generator expressions would be:

''.join((rules.get(char, char) for char in source_text))

LeafStorm 2010-02-13 14:49:36

Thank You Sir:)

mgj 2010-02-13 15:02:00

Answer 4

A:

This seems pretty straightforward. If your dictionary is char to char, then you would do something like

outstr = ''
for ch in instr:
    if ch in mydict:
        outstr += mydict[ch]
    else:
        outstr += ch

Here, instr is your input string and mydict contains your mapping of chars to chars.

If you want to check parts of words, I would recommend using two dictionaries: one that contains the characters that are contained in any word, and one that contains the words. You could use it like this:

outstr = ''
word = ''
for ch in instr:
    if ch in chardict:
        word += ch
    else:
        if len(word):
            if word in worddict:
                outstr += worddict[word]
            else:
                outstr += word
            word = ''
        outstr += ch
if len(word):
    outstr += worddict[word]
else:
    outstr += word

chardict might contain all of the alphabet for instance. Of course, you might want to do some parts a little bit differently (like use something other than chardict to check if a char is to be considered part of a valid word - perhaps something with a binary search), but hopefully you get the idea.

Justin Peel 2010-02-13 15:56:50

Answer 5

+3 A:

If you use the translate method of Unicode objects, as I recommended in answer to another question of yours, everything's done automatically for you exactly as you desire: each Unicode character c whose codepoints (ord(c)) is not in the transliteration dictionary is simply passed unchanged from input to output, just as you want. Why reinvent the wheel?

Alex Martelli 2010-02-13 16:58:40

Point Sir..:) I will try out this method.

mgj 2010-02-20 14:40:54

ansaurus

tags:

views:

answers:

Python - to check if a char is in dictionary and if not to deal with it

related questions