ansaurus

Question

Regex to check if valid URL that ends in .jpg, .png, or .gif

Answer 1

+10 A:

(?:([^:/?#]+):)?(?://([^/?#]*))?([^?#]*\.(?:jpg|gif|png))(?:\?([^#]*))?(?:#(.*))?

That's a (slightly modified) version of the official URI parsing regexp from RFC 2396. It allows for #fragments and ?querystrings to appear after the filename, which may or may not be what you want. It also matches any valid domain, including localhost, which again might not be what you want, but it could be modified.

A more traditional regexp for this might look like the below.

^https?://(?:[a-z\-]+\.)+[a-z]{2,6}(?:/[^/#?]+)+\.(?:jpg|gif|png)$
          |-------- domain -------|--- path ---|-- extension ---|

EDIT See my other comment, which although isn't answering the question as completely as this one, I feel it's probably a more useful in this case. However, I'm leaving this here for ~~karma-whoring~~ completeness reasons.

Dan 2008-10-04 02:56:26

Replace "jpg" with "jpe?g" and also set the case-insensitive flag on the regex.

Ates Goral 2008-10-05 21:14:54

Answer 2

A:

^((http(s?)\:\/\/|~/|/)?([\w]+:\w+@)?([a-zA-Z]{1}([\w-]+.)+([\w]{2,5}))(:[\d]{1,5})?((/?\w+/)+|/?)(\w+.(jpg|png|gif))

Mark Cidade 2008-10-04 02:56:52

Answer 3

+2 A:

In general, you're better off validating URLs using built-in library or framework functions, rather than rolling your own regular expressions to do this - see What is the best regular expression to check if a string is a valid URL for details.

If you are keen on doing this, though, check out this question:

Getting parts of a URL (Regex)

Then, once you're satisfied with the URL (by whatever means you used to validate it), you could either use a simple "endswith" type string operator to check the extension, or a simple regex like

(?i)\.(jpg|png|gif)$

Blair Conrad 2008-10-04 02:57:34

Very good call! Here is what I did kids:1) Use JQuery's validation http://docs.jquery.com/Plugins/Validation2) Add in this regex piece to the end of URL, and renamed it "urlphoto"Works like a charm! good job

2008-10-04 03:05:44

Answer 4

+18 A:

Actually.

Why are you checking the URL? That's no guarantee what you're going to get is an image, and no guarantee that the things you're rejecting aren't images. Try performing a HEAD request on it, and see what content-type it actually is.

Dan 2008-10-04 03:10:55

Depending on how paranoid you are about the data behind the URL, you might want to go beyond just the HEAD and grab the full content of the (alleged) image, then use an image-processing library to verify that it is what it claims to be.

Dave Sherohman 2008-10-04 14:30:26

That even wouldn't guarantee the image still exists and actually being an image at the moment the URL is served to a browser.

Marcel Korpel 2010-08-17 13:19:16

Answer 5

+2 A:

Here's the basic idea in Perl. Salt to taste.

#!/usr/bin/perl

use LWP::UserAgent;

my $ua = LWP::UserAgent->new;

@ARGV = qw(http://www.example.com/logo.png);

my $response = $ua->head( $ARGV[0] );

my( $class, $type ) = split m|/|, lc $response->content_type;

print "It's an image!\n" if $class eq 'image';

If you need to inspect the URL, use a solid library for it rather than trying to handle all the odd situations yourself:

use URI;

my $uri = URI->new( $ARGV[0] );

my $last = ( $uri->path_segments )[-1];

my( $extension ) = $last =~ m/\.([^.]+)$/g;

print "My extension is $extension\n";

Good luck, :)

brian d foy 2008-10-04 06:52:39

Answer 6

+1 A:

If you really want to be sure, grabbing the first kilobyte or two of the given URL should be sufficient to determine everything you need to know about the image.

Here's an example of how you can get that information, using Python, and here's an example of it being put to use, as a Django form field which allows you to easily validate an image's existence, filesize, dimensions and format, given its URL.

insin 2008-10-04 08:52:32

Answer 7

+1 A:

(http(s?):)|([/|.|\w|\s])*\.(?:jpg|gif|png)

This will mach all images from this string:

background: rgb(255, 0, 0) url(../res/img/temp/634043/original/cc3d8715eed0c.jpg) repeat fixed left top; cursor: auto;
<div id="divbg" style="background-color:#ff0000"><img id="bg" src="../res/img/temp/634043/original/cc3d8715eed0c.jpg" width="100%" height="100%" /></div>
background-image: url(../res/img/temp/634043/original/cc3d8715eed0c.png);
background: rgb(255, 0, 0) url(http://google.com/res/../img/temp/634043/original/cc3    _d8715eed0c.jpg) repeat fixed left top; cursor: auto;
background: rgb(255, 0, 0) url(https://google.com/res/../img/temp/634043/original/cc3_d8715eed0c.jpg) repeat fixed left top; cursor: auto;

Test your regex here: http://regex.larsolavtorvik.com/

FDisk 2009-09-18 08:46:10

ansaurus

tags:

views:

answers:

Regex to check if valid URL that ends in .jpg, .png, or .gif

related questions