nodeValue from DomDocument returning weird characters in PHP

views:

309

answers:

nodeValue from DomDocument returning weird characters in PHP

So I'm trying to parse HTML pages and looking for paragraphs (<p>) using get_elements_by_tag_name('p');

The problem is that when I use $element->nodeValue, it's returning weird characters. The document is loaded first into $html using curl then loading it into a DomDocument.

I'm sure it has to do with charsets.

Here's an example of a response: "aujourdÃ¢Â€Â™hui".

Thanks in advance.

+1 A:

This is an encoding issue. try explicitly setting the encoding to UTF-8.

this should help: http://devzone.zend.com/article/8855

prodigitalson 2010-01-08 02:09:53

Already tried that and it didn't work... The funny thing is that if I do $doc->saveHTML(), the returning html's encoding is totally correct.

Elie 2010-01-08 02:27:41

Whats the `<meta http-equiv="Content-type" ... />` specified in the HTML?

prodigitalson 2010-01-11 15:45:45

related questions

IDE suggestions: Eclipse IDE vs. Zend Studio ( confused )

MySQL/Apache Error in PHP MySQL query

Lightweight IDE for Linux

What PHP framework would you choose for a new application and why?

Why is my ternary expression not working?

How can I get at the matches when using preg_replace in PHP?

Mechanisms for tracking DB schema changes

Wordpress theme development offline tools

Using object property as default for method property

How can I get the authenticated user name under Apache using plain HTTP authentication and PHP?

Make XAMPP/Apache serve file outside of htdocs

How do you debug PHP scripts?

PHP Variables passed by value or by reference?

Best way to implement unit testing in PHP

Connect PHP to an AS/400

Best way to access Exchange using PHP?

PHP Session Security

How do I access a remote form in php?

What's the best way to generate a tag cloud from an array? (using h1 through h6 for sizing)

Apache/PHP: error_log per Virtual Host?

How do I track file downloads with apache/PHP

How would you access Object properties from within an object method?

Flat File Databases in PHP

Best way to allow plugins for a PHP application

Latest information on PHP upcoming releases

ansaurus

tags:

views:

answers:

nodeValue from DomDocument returning weird characters in PHP

related questions