Blocks of similar text for test data | ansaurus

tags:

unit-testing

views:

52

answers:

1

Q:

Blocks of similar text for test data

For testing purposes I need to create sets of text files that have similar but not identical text. Each set needs to be different from the other set but also share some commonality.

For example, I may need to create 10 sets of 20 documents each for a total of 200 documents. Each document needs about 250 words in it.

If one of the sets of documents is about dogs then it would be appropriate that the other sets' documents be about animals, for example, such that there is a weak link between each set (in this case animals) and a strong link between the documents within a set (such as dogs in one set and cats in another set).

The words in the documents do not need to be in any particular order, nor do they need to be in sentences or make sense.

Does anybody know how I can generate or obtain this type of data for my unit tests?

+3 A:

How about grabbing some text from Project Gutenberg?

Doug Currie 2009-01-06 05:03:55

Great idea Doug - thanks - I've just been looking at the web and am now trying to work out how to find a collection of books that are about the same subject.

Guy 2009-01-06 05:15:00

related questions

Unit tests for deep cloning

What is the difference between integration and unit tests?

Unit test execution speed (how many tests per second?)

Run PHPUnit Tests in Certain Order

Testing a function that throws on failure

What do you use to Unit-Test your Web UI?

How do you test/change untested and untestable code?

How to make junior programmers write tests?

How do you mock a Sealed class?

RhinoMocks: How do you properly mock an IEnumerable<T>?

What is the best way to do unit testing for ASP.NET 2.0 web pages?

TDD and Mocking Frameworks

Unit testing with VSTest basics

How to set up unit testing for Visual Studio C++

What are the primary differences between TDD and BDD?

What is Your Experience with Unit Testing in Practice?

How can I unit test Flex applications from within the IDE or a build script?

C# Unit Testing Example

How do I unit test persistence?

.NET Unit Testing packages?

Integrating Visual Studio Test Project with Cruise Control

What is unit testing?

Best way to implement unit testing in PHP

.NET Testing Framework Advice

Using combinations of sets as test data