If you can view the HTML code of the website that you are trying to extract the information out of and they have a logical naming system for their news article entities, you should be able to use the fopen command eg.
<?php
$handle = fopen("http://www.example.com/", "r");
?>
And then with the information that it extracts from the article code if the article code was laid out like the following:
<div class="post" id="post-16283">
<div class="postheader">
<h1 id="article-title">Test Article Code</h1>
</div>
<div class="postcontent">
This is the article text
</div>
<div class="postfooter">
Copyright Information
</div>
</div>
You could then use the following php code to show all the titles of the articles:
if (preg_match_all("#<div class="postheader"(.*?)</div>#s", $handle, $matches, PREG_PATTERN_ORDER) > 0) {
foreach ($matches[0] as $match) {
echo $match;
}
}
This is just a basic indicator of how to extract information off the web page. It can be developed so you can extract the information article by article off the web page and then even format it your own way.
Hope that helps