php regular expression to get the specific url

Question

I would like to get the urls from a webpage that starts with "http://example.com/category/" from these tags below:

<td><a href="http://example.com/category/subcategory/product/257849" title="Sample Title">Test</a></td>

Note:

257849 = random number

Any suggestion would be very much appreciated.

Thanks!

Another question of yours. stackoverflow.com/questions/5636972/… — Evan Mulawski
– Evan Mulawski, Commented May 6, 2011 at 12:02

mario · Accepted Answer · 2011-05-06 12:01:08Z

1

Just specify the fixed base URL asis in the regex, and use [\w/]+ to match any combination of letters, numbers and the / slash afterwards:

preg_match('#http://example.com/category/[\w/]+#', $text, $match);
print $match[0];

And to extract all urls at once, use preg_match_all() instead.

answered May 6, 2011 at 12:01

mario

146k20 gold badges243 silver badges293 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

It's unlikely but there are other characters like % that could be valid and appear in the URL

Benjol · Accepted Answer · 2012-08-10 09:38:06Z

0

preg_match_all('#http://example.com/category[^"]+#', $text, $a);

The result will be in $a

Benjol

67.1k56 gold badges193 silver badges278 bronze badges

answered May 6, 2011 at 12:02

James C

14.2k1 gold badge37 silver badges45 bronze badges