Python Forum
how to parse with BeautifulSoup
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
how to parse with BeautifulSoup
#1
This format is confusing me.

What is the best way to parse into individual components with BeautifulSoup.

html:
Output:
<td class="small"> <b> [Amend] </b> <b> [Cover] </b> Material Amendment to Form ATS-N (Rule 304(a)(2)(i)(A)) <br/> Acc-no: 0001609177-23-000017 (34 Act) Size: 3 KB </td>
desired resilts:
Output:
[Amend] [Cover] Material Amendment to Form ATS-N (Rule 304(a)(2)(i)(A)) Acc-no: 0001609177-23-000017 (34 Act) Size: 3 KB
I have tried:
td.get_text(strip=True).split('\n')
which results in a list of length 1:
Output:
['[Amend][Cover]Material Amendment to Form ATS-N (Rule 304(a)(2)(i)(A))Acc-no: 0001609177-23-000017\xa0(34 Act)\xa0 Size: 3 KB']
Also tried numerous other methods with improper results.
Reply


Messages In This Thread
how to parse with BeautifulSoup - by Larz60+ - Oct-18-2023, 11:22 AM
RE: how to parse with BeautifulSoup - by menator01 - Oct-18-2023, 12:12 PM
RE: how to parse with BeautifulSoup - by Larz60+ - Oct-18-2023, 02:03 PM
RE: how to parse with BeautifulSoup - by menator01 - Oct-18-2023, 02:25 PM

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020