I want to delete the specific div from soup object.
I am using python 2.7 and bs4.
According to documentation we can use div.decompose().
But that would delete all the div. How can I delete a div with specific class?
I want to delete the specific div from soup object.
I am using python 2.7 and bs4.
According to documentation we can use div.decompose().
But that would delete all the div. How can I delete a div with specific class?
Sure, you can just select, find, or find_all the divs of interest in the usual way, and then call decompose() on those divs.
For instance, if you want to remove all divs with class sidebar, you could do that with
# replace with `soup.findAll` if you are using BeautifulSoup3
for div in soup.find_all("div", {'class':'sidebar'}):
div.decompose()
If you want to remove a div with a specific id, say main-content, you can do that with
soup.find('div', id="main-content").decompose()
This will help you:
from bs4 import BeautifulSoup
markup = '<a>This is not div <div class="1">This is div 1</div><div class="2">This is div 2</div></a>'
soup = BeautifulSoup(markup,"html.parser")
a_tag = soup
soup.find('div',class_='2').decompose()
print a_tag
Output:
<a>This is not div <div class="1">This is div 1</div></a>
Let me know if it helps
Hope it help:
from bs4 import BeautifulSoup
from bs4.element import Tag
markup = '<a>This is not div <div class="1">This is div 1</div><div class="2">This is div 2</div></a>'
soup = BeautifulSoup(markup,"html.parser")
for tag in soup.select('div.1'):
tag.decompose()
print(soup)
from BeautifulSoup import BeautifulSoup
>>> soup = BeautifulSoup('<body><div>1</div><div class="comment"><strong>2</strong></div></body>')
>>> for div in soup.findAll('div', 'comment'):
... div.extract()
...
<div class="comment"><strong>2</strong></div>
>>> soup
<body><div>1</div></body>