Matching a string to a regex from html input

Question

I'm having a little trouble figuring out what to do.

Basically using java I'm trying to:

Reading in the html from a website
I want to find the content after a certain string in this case being
```
 title="
```
Store that in a string.

The first and last steps are simple for me but I'm having no luck (and never had with regex).

I believe this is the beginning of what I need:

   String regex = "(?<=title=\")\\S+";
   Pattern name = Pattern.compile(regex);

After that I have no clue. Any help?

I suggest using some library for this (you'll get even XPath support): HttpUnit, JSoup, NekoHtml — Xeon
– Xeon, Commented Jul 27, 2012 at 17:23

Ωmega · Accepted Answer · 2012-07-27 17:57:46Z

1

import java.util.regex.Matcher;
import java.util.regex.Pattern;

String EXAMPLE_TEST = "......";
Pattern pattern = Pattern.compile("(?<=title=\")(\\S+)")
Matcher matcher = pattern.matcher(EXAMPLE_TEST);
while (matcher.find()) {
  System.out.println(matcher.group());
}

Note: You might consider to use regex pattern (?<=title=\")([^\"]*)

edited Jul 27, 2012 at 17:57

answered Jul 27, 2012 at 17:25

Ωmega

44k35 gold badges143 silver badges213 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

maerics Over a year ago

Do we want the whole title or just until the first whitespace?

Ωmega Over a year ago

@maerics - I believe OP didn't ask to change regex pattern but (s)he wants help with rest of code to get match to variable, or so...

Alex W · Accepted Answer · 2012-07-27 17:30:26Z

0

List<String> result_list = new ArrayList<String>();
Pattern p = Pattern.compile("title=\"(.*)\"");
Matcher m = p.matcher("title=\"test\"");
boolean result = m.find();

while(result)
{
    result_list.add(m.group(0));
    result = m.find();
}

answered Jul 27, 2012 at 17:30

Alex W

38.5k13 gold badges115 silver badges115 bronze badges

Collectives™ on Stack Overflow

Matching a string to a regex from html input

2 Answers 2

2 Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

2 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related