I'm trying to find a string inside a HTML page with known patterns. for example, in the following HTML code: As A Row
I need your help, if the following code below counts every …
How To Make A Transition Effect Up The Input On Change
I need to place this particular effect on a dropdown I need…
Some Questions About Tree Construction [html Spec]
I know that it's not customary to ask a bunch of questi…
Solution 1:
re.findall(r'<HR>\s*<font size="\+1">(.*?)</font><BR>' , html, re.DOTALL)
Copy findall
is returning a list with everything that is captured between the brackets in the regular expression. I used re.DOTALL so the dot also captures end of lines.
I used \s*
because I was not sure whether there would be any whitespace.
Solution 2:
This works, but may not be very robust:
import re
r = re.compile('<HR > \s?<font size ="\+1" > (.+?)</font > \s?<BR > ', re.IGNORECASE)
r.findall(html)
Copy You will be better off using a proper HTML parser. BeautifulSoup is excellent and easy to use. Look it up.
Solution 3:
re.findall(r'<HR>\n<font size="\+1">([^<]*)<\/font><BR>' , html, re.MULTILINE)
Copy
Post a Comment for "Search In Html Page Using Regex Patterns With Python"