37 Commits

Author SHA1 Message Date
Serhiy Storchaka
50f948edda bpo-30011: Fixed race condition in HTMLParser.unescape(). (#1140) 2017-04-15 18:35:46 +03:00
Ezio Melotti
5a88853bdc #20288: fix handling of invalid numeric charrefs in HTMLParser. 2014-02-01 21:20:22 +02:00
Ezio Melotti
b814745226 #19480: HTMLParser now accepts all valid start-tag names as defined by the HTML5 standard. 2013-11-07 18:31:36 +02:00
Ezio Melotti
c45868ec69 #14538: HTMLParser can now parse correctly start tags that contain a bare /. 2012-04-18 19:08:41 -06:00
Ezio Melotti
36b7361fe7 HTMLParser is now able to handle slashes in the start tag. 2012-02-21 09:22:16 +02:00
Ezio Melotti
65d36dab4d #13987: HTMLParser is now able to handle malformed start tags. 2012-02-15 13:19:10 +02:00
Ezio Melotti
d2307cb48a #13987: HTMLParser is now able to handle EOFs in the middle of a construct. 2012-02-15 12:44:23 +02:00
Ezio Melotti
369cbd744e Fix an index, add more tests, avoid raising errors for unknown declarations, and clean up comments. 2012-02-13 20:36:55 +02:00
Ezio Melotti
f117443cb8 #13993: HTMLParser is now able to handle broken end tags. 2012-02-13 16:28:54 +02:00
Ezio Melotti
4b92cc3f79 #13960: HTMLParser is now able to handle broken comments. 2012-02-13 16:10:44 +02:00
Ezio Melotti
00dc60beee #13358: HTMLParser now calls handle_data only once for each CDATA. 2011-11-18 18:00:40 +02:00
Ezio Melotti
0f1571ce7f #1745761, #755670, #13357, #12629, #1200313: improve attribute handling in HTMLParser. 2011-11-14 18:04:05 +02:00
Ezio Melotti
7e82b276dd #670664: Fix HTMLParser to correctly handle the content of `<script>...</script> and <style>...</style>`. 2011-11-01 14:09:56 +02:00
Éric Araujo
31890bc9ba Fix display of html.parser.HTMLParser.feed docstrin 2011-05-25 18:11:43 +02:00
Ezio Melotti
9f1ffb2ae9 #7311: fix HTMLParser to accept non-ASCII attribute values. 2011-04-05 20:40:52 +03:00
Senthil Kumaran
3f60f09eb2 Fix Issue10759 - HTMLParser.unescape() to handle malform charrefs. 2010-12-28 16:05:07 +00:00
Victor Stinner
554a3b82e4 Issue #6662: Fix parsing of malformatted charref (&#bad;) 2010-05-24 21:33:24 +00:00
Fred Drake
d995e1150c revert creation of the html.entities and html.parser modules
(http://bugs.python.org/issue2882)
2008-05-20 06:08:38 +00:00
Fred Drake
91ae250273 rename HTMLParser to html.parser, htmlentitydefs to html.entities
(http://bugs.python.org/issue2882)
2008-05-17 20:30:04 +00:00
Martin v. Löwis
ab8a6bba25 Patch #912410: Replace HTML entity references for attribute values
in HTMLParser.
2007-03-06 14:43:00 +00:00
Georg Brandl
cd3c26a717 Reverting previous checkin. This breaks too much of HTMLParser to be applied
without thought. Anyway, such malformed HTML is better handled by something
like BeautifulSoup.
2005-09-01 06:25:34 +00:00
Georg Brandl
7847405a76 bug [ 761452 ] HTMLParser chokes on my.yahoo.com output 2005-08-31 22:08:45 +00:00
Fred Drake
49b4d19172 remove unnecessary override of base class method 2004-09-08 22:58:36 +00:00
Andrew M. Kuchling
b7d8ce0275 [Bug #921657] Allow '@' in unquoted HTML attributes. Not strictly legal according to the HTML REC, but HTMLParser is already a pretty loose parser. Reported by Bernd Zimmermann. 2004-06-05 15:31:45 +00:00
Walter Dörwald
70a6b49821 Replace backticks with repr() or "%r"
From SF patch #852334.
2004-02-12 17:35:32 +00:00