ansaurus

Question

How to make this random text generator more efficient in Python ?

Answer 1

A:

Some suggested improvements:

The while loop will run forever, you should probably remove it.
Use max and generator expressions to generate the longest word in a memory-efficient manner.
You should generate a list of sentences with a length greater than 40 characters that include longestWord with a list comprehension. This should also be removed from the while loop, as it only happens.

sents = [" ".join(sent) for sent in listOfSents if longestWord in sent and len(sent) > 40]
If you want to print out every sentence that is found in a random order, then you could try shuffling the list you just created:

for sent in random.shuffle(sents): print sent

This is how the code could look with these changes:

import nltk
from nltk.corpus import gutenberg
from random import shuffle

listOfSents = gutenberg.sents()
triggerSentence = raw_input("Please enter the trigger sentence: ")

longestWord = max(triggerSentence.split(), key=len)
longSents = [" ".join(sent) for sent in listOfSents 
                 if longestWord in sent 
                 and len(sent) > 40]

for sent in shuffle(longSents):
    print sent

Tim McNamara 2010-10-11 06:18:56

Builtin max() takes a function. longestWord = max(triggerSentence.split(), key=len)

kevpie 2010-10-13 17:08:58

+1 thanks @kevpie, didn't know about the `key` argument

Tim McNamara 2010-10-13 19:02:40

Answer 2

+1 A:

If all you need is generate random text (I guess, with requirement that it should contain meaningful sentences) you can do it much simpler: Just generate random numbers and use them as index to retrieve sentences from your text database (be it Project Gutenberg or whatever).

thor 2010-10-20 09:41:44

ansaurus

tags:

views:

answers:

How to make this random text generator more efficient in Python ?

related questions