views:

28

answers:

1

I need to split html document on two parts. First part, should contain N(30) words, and next one should contain everything else. And the main problem, is to prevent splitting tags (description and body of tags).

  1. <a **<=>** href="text" > text </a>

  2. <a href="text" > **<=>** text </a>

  3. <a href="text" > text </ **<=>** a>

Give me please suggestions (or if you have already written such function, please share your code), how to realize it! Thanks.

+1  A: 

Use DOM Parser, documentation you can find at http://php.net/manual/en/book.dom.php

You can parse your html in a tree with this class, and get your result by tree manipulation and saving your data.

Svisstack