How would it be possible to do the following:
- Scan through an html page (preferably through a whole domain (www.python.org) and extract all
h1 h2 ...hn Tags
and write all Headings to a file. In the correct order:
Start with h1 Than h2
until we reach the next h1