ansaurus

Question

Answer 1

A:

It's actually very simple, a more flexible and straightforward approach is to explode() the url into an array called something like $segments, and then test on there. If you have a very small number of expected URLs, then this kind of approach is probably easier to maintain and to read.

I wouldn't recommend doing this in the htaccess file because of the performance overhead.

danp 2010-08-09 08:54:34

Answer 2

+1 A:

I'm not sure regular expressions are the way to go. I think it would probably be easier to use explode ('/' , $url) and check by looping over that array.

Here are the steps I would follow:

$url = parse_url($url, PHP_URL_PATH); 
$url = trim($url, '/'); 
$parts = explode ('/' , $url);

Then you can check if

($parts[0]=='content' && $parts[1]=='view' && $parts[3]=='34')

You can also easily get the information you want with $parts[2].

Green 2010-08-09 08:55:13

Thanks - how would I go about using a check loop? I know what loops are but is a check loop something specific, or do you just mean to loop through the exploded bits and check based on a numerical array?I am thinking parse_url, explode, then check loop?

Dan 2010-08-10 11:38:25

The check loop was a typo. I edited my original post with more details.

Green 2010-08-10 12:36:18

Thanks - I am trying a couple of different options based on execution time but that's very useful.

Dan 2010-08-11 10:39:48

Answer 3

A:

First, I would use the PHP function parse_url() to get the path, devoid of any protocol or hostname.

Once you have that the following code should get you the info you need.

<?php

$url = 'http://domain.com/content/view/*/34/'; // first example
$url = 'http://domain.com/content/view/*/34/1/*/'; // second example
$url_array = parse_url($url);

$path = $url_array['path'];

// Match the URL against regular expressions
if (preg_match('/content\/view\/([^\/]+)\/([0-9]+)\//i', $path, $matches)){        
        print_r($matches);
}

if (preg_match('/content\/view\/([^\/]+)\/([0-9]+)\/([0-9]+)\/([^\/]+)/i', $path, $matches)){        
        print_r($matches);
}

?>

([^\/]+) matches any sequence of characters except a forward slash

([0-9]+) matches any sequence of numbers

Though you can probably write a single regular expression to match most URL variants, consider using multiple regular expressions to check for different types of URLs. Depending on how much traffic you get, the speed hit won't be all that terrible.

Also, I recommend reading Mastering Regular Expressions by O'reilly. A good knowledge of regular expressions will come in handy quite often.

http://www.regular-expressions.info/php.html

John Kramlich 2010-08-10 07:26:57

Thanks - I seem to be having problems with an unknown modifier "v" when running a preg_match using this method?

Dan 2010-08-10 10:03:15

I forgot to escape the forward slashes. preg_match() considers them special characters that delimit the regular expression. Please see my updated post with code samples. It has been tested with PHP 5.3 and should be backwards compatible.

John Kramlich 2010-08-11 04:25:53

Perfect - I think I can finish from here! Thanks.

Dan 2010-08-11 10:41:35

ansaurus

tags:

views:

answers:

PHP Regex on URL - split into variables

related questions