Channel: BeautifulSoup - easy way to to obtain HTML-free contents - Stack Overflow

BeautifulSoup - easy way to to obtain HTML-free contents

December 28, 2009, 8:02 am

≪ Previous: Answer by Jonathan Feinberg for BeautifulSoup - easy way to to obtain HTML-free contents

I'm using this code to find all interesting links in a page:

soup.findAll('a', href=re.compile('^notizia.php\?idn=\d+'))

And it does its job pretty well. Unfortunately inside that a tag there are a lot of nested tags, like font, b and different things... I'd like to get just the text content, without any other html tag.

Example of link:

<A HREF="notizia.php?idn=1134" OnMouseOver="verde();" OnMouseOut="blu();"><FONT CLASS="v12"><B>03-11-2009:&nbsp;&nbsp;<font color=green>CCS Ingegneria Elettronica-Sportello studenti ed orientamento</B></FONT></A>

Of course it's ugly (and the markup is not always the same!) and I'd like to get:

03-11-2009:  CCS Ingegneria Elettronica-Sportello studenti ed orientamento

In the documentation it says to use text=True in findAll method, but it will ignore my regex. Why? How can I solve that?

↧

↧

Latest Images

Eco Data 4/26/24

Eco Data 4/26/24

April 25, 2024, 5:00 pm

‘Pay day every day’ may become Shangri-La Group, BPOs’ secret to happy employees

April 25, 2024, 5:51 am

Nonprofit donates custom home in this East Bay city for Marine injured in...

Nonprofit donates custom home in this East Bay city for Marine injured in...

April 23, 2024, 7:00 am

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

April 22, 2024, 6:00 am

Ukraine bans military from online gambling amid addiction concerns

Ukraine bans military from online gambling amid addiction concerns

April 22, 2024, 5:17 am

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

April 20, 2024, 8:08 pm

OCBC Bank Singapore Offers Up to 2.8% p.a. Fixed Deposit Promotion from 21...

April 20, 2024, 12:38 pm

National Poetry Month 2024: Maxine Starr

National Poetry Month 2024: Maxine Starr

April 19, 2024, 9:56 am

Vegan Chicken Pot Pie

Vegan Chicken Pot Pie

April 19, 2024, 9:18 am

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

April 19, 2024, 7:03 am

Trending Articles

A Wall Street guide to watches

August 5, 2015, 7:32 am

Who Is Junior Pope?| Biography| Profile| History Of Nollywood Actor “Pope...

July 26, 2017, 8:45 am

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

August 20, 2016, 5:13 pm

NAT, NCAE, LAPG, SREYA, ELNA and PHIL-IR Materials and Reviewers

February 27, 2017, 6:16 pm

Bar Rescue - The Prime Bar (WildeFire Bistro) Update

September 15, 2019, 6:50 am

Practice Sheet of Right form of verbs for HSC Students

September 22, 2019, 11:40 pm

[THEME] osTicket Awesome ― fully responsive theme

May 29, 2016, 6:25 pm

Pengalaman Rawatan di Klinik Dr. Ko

October 15, 2021, 7:41 am

A List of Glasses Wholesale Markets in Guangzhou–World of Spectacles

August 22, 2017, 9:42 am

Housefull 4 (2019) Hindi 1080p WEB-DL 1.4GB ESubs Download Khatrimaza

December 19, 2019, 9:12 pm

Who Is Jennifer Hines? Bryan Olesen Wife Is Mother Of 3 Kids

March 5, 2024, 2:19 am

Consuelo Ortiga y Rey: The "Crush ng Bayan" in Rizal's Time

August 4, 2013, 11:32 pm

AUDIO | Diamond Platnumz ft Mugabe - LawaMa | Download

July 25, 2014, 8:00 am

Gangland murders in Dublin (1990-94)

April 17, 2020, 1:54 am

Varzish Sport Tv HD Biss Key Frequency Update

January 15, 2017, 9:03 pm

Tuck Mill sells for £1.4 million

April 15, 2013, 5:22 am

Guntur District Police Officers Mobile Numbers

April 17, 2017, 2:10 am

Plymouth woman who bit ear was 'like an animal', court told

January 27, 2017, 9:56 am

Read GOS (Generic Object Service) Picture Attachments and Display it into...

February 14, 2014, 1:08 pm

Empirical Labs Arousor v2.1.0-R2R

February 11, 2018, 8:42 pm

Latest Images

Eco Data 4/26/24

Eco Data 4/26/24

April 25, 2024, 5:00 pm

‘Pay day every day’ may become Shangri-La Group, BPOs’ secret to happy employees

April 25, 2024, 5:51 am

Nonprofit donates custom home in this East Bay city for Marine injured in...

Nonprofit donates custom home in this East Bay city for Marine injured in...

April 23, 2024, 7:00 am

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

New private rooms on Tokaido Shinkansen change the way we travel from Tokyo...

April 22, 2024, 6:00 am

Ukraine bans military from online gambling amid addiction concerns

Ukraine bans military from online gambling amid addiction concerns

April 22, 2024, 5:17 am

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

ಮಂಡ್ಯದಿಂದ ಸುಮಲತಾ ದೂರ; ಹೆಚ್‌ಡಿಕೆ ಪರ ಪ್ರಚಾರಕ್ಕಿಳಿಯದ ಸಂಸದೆ –ಬರ್ತಾರೆ ನೋಡೋಣ ಎಂದ...

April 20, 2024, 8:08 pm

OCBC Bank Singapore Offers Up to 2.8% p.a. Fixed Deposit Promotion from 21...

April 20, 2024, 12:38 pm

National Poetry Month 2024: Maxine Starr

National Poetry Month 2024: Maxine Starr

April 19, 2024, 9:56 am

Vegan Chicken Pot Pie

Vegan Chicken Pot Pie

April 19, 2024, 9:18 am

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

Firefox UX: On Purpose: Collectively Defining Our Team’s Mission Statement

April 19, 2024, 7:03 am

© 2024 //www.rssing.com