Parse HTML page to get contents of <p> and <b> tags

There are lots of HTML pages which are structured as a sequence of such groups:

   <b> Keywords/Category:</b>
   "keyword_a, keyword_b"

The addresses of these pages are like,, etc.

How can I extract the keywords separately from each of such pages? I've tried to use BeautifulSoup, but unsuccessfully. I've only written the program that prints titles of groups (between <b> and </b>).

from bs4 import BeautifulSoup
from urllib2 import urlopen
import re
html_doc = urlopen('').read()
soup = BeautifulSoup(html_doc)
for link in soup.find_all('a'):
    print ''+link.get('href')
for node in soup.findAll('b'):
    print ''.join(node.findAll(text=True))

I can't test this without knowing the actual source code format but it seems you want the <p> tags text vaue:

for node in soup.findAll('p'):
    # or: keywords = node.text.split(', ')
    # print(keywords)

You need to split your string which in this case is url with /

And then you can choose chunks you want

For example if url is i use split function to split url with / sign

it will convert it to array and then i choose what i need and again convert it to string with ''.join() method you can read about split method in this link

Assuming for each block

   <b> Keywords/Category:</b>
   "keyword_a, keyword_b"

you want to extract keyword_a and keyword_b for each Keywords/Category. So an example would be:

    "elephant, rhino"
    "hummingbird, ostrich"

Once you have the HTML code, you can do:

from bs4 import BeautifulSoup

html = '''<p>
    "elephant, rhino"
    "hummingbird, ostrich"

soup = BeautifulSoup(html, 'html.parser')

p_elements = soup.find_all('p')
for p_element in p_elements:
    b_element = soup.find_all('b')[0]
    category = b_element.text.strip()
    keywords = p_element.text.strip()
    keyword_a, keyword_b = keywords[1:-1].split(', ')
    print('Category:', category)
    print('Keyword A:', keyword_a)
    print('Keyword B:', keyword_b)

Which prints:

Category: Mammals
Keyword A: elephant
Keyword B: rhino
Category: Birds
Keyword A: hummingbird
Keyword B: ostrich

