37 Commits

Author SHA1 Message Date
Ezio Melotti
5a88853bdc #20288: fix handling of invalid numeric charrefs in HTMLParser. 2014-02-01 21:20:22 +02:00
Ezio Melotti
b814745226 #19480: HTMLParser now accepts all valid start-tag names as defined by the HTML5 standard. 2013-11-07 18:31:36 +02:00
Ezio Melotti
c45868ec69 #14538: HTMLParser can now parse correctly start tags that contain a bare /. 2012-04-18 19:08:41 -06:00
Ezio Melotti
36b7361fe7 HTMLParser is now able to handle slashes in the start tag. 2012-02-21 09:22:16 +02:00
Ezio Melotti
65d36dab4d #13987: HTMLParser is now able to handle malformed start tags. 2012-02-15 13:19:10 +02:00
Ezio Melotti
d2307cb48a #13987: HTMLParser is now able to handle EOFs in the middle of a construct. 2012-02-15 12:44:23 +02:00
Ezio Melotti
369cbd744e Fix an index, add more tests, avoid raising errors for unknown declarations, and clean up comments. 2012-02-13 20:36:55 +02:00
Ezio Melotti
f117443cb8 #13993: HTMLParser is now able to handle broken end tags. 2012-02-13 16:28:54 +02:00
Ezio Melotti
4b92cc3f79 #13960: HTMLParser is now able to handle broken comments. 2012-02-13 16:10:44 +02:00
Ezio Melotti
6b7003a18c #13576: add tests about the handling of (possibly broken) condcoms. 2011-12-19 07:28:08 +02:00
Ezio Melotti
00dc60beee #13358: HTMLParser now calls handle_data only once for each CDATA. 2011-11-18 18:00:40 +02:00
Ezio Melotti
0f1571ce7f #1745761, #755670, #13357, #12629, #1200313: improve attribute handling in HTMLParser. 2011-11-14 18:04:05 +02:00
Ezio Melotti
74592919d6 Group tests about attributes in a separate class. 2011-11-08 02:07:18 +02:00
Ezio Melotti
7e82b276dd #670664: Fix HTMLParser to correctly handle the content of `<script>...</script> and <style>...</style>`. 2011-11-01 14:09:56 +02:00
Ezio Melotti
9f1ffb2ae9 #7311: fix HTMLParser to accept non-ASCII attribute values. 2011-04-05 20:40:52 +03:00
Senthil Kumaran
3f60f09eb2 Fix Issue10759 - HTMLParser.unescape() to handle malform charrefs. 2010-12-28 16:05:07 +00:00
Victor Stinner
554a3b82e4 Issue #6662: Fix parsing of malformatted charref (&#bad;) 2010-05-24 21:33:24 +00:00
Benjamin Peterson
b3be23ad27 remove the svn:executable property from files that don't have shebang lines 2010-03-05 03:20:06 +00:00
Georg Brandl
bcdafa44f2 Remove html package and fix test_htmlparser. 2008-05-20 07:58:42 +00:00
Benjamin Peterson
d21dabc890 change some imports in tests so they will not be skipped in 3.0 2008-05-18 23:07:07 +00:00
Christian Heimes
c5f05e45cf Patch #2167 from calvin: Remove unused imports 2008-02-23 17:40:11 +00:00
Martin v. Löwis
ab8a6bba25 Patch #912410: Replace HTML entity references for attribute values
in HTMLParser.
2007-03-06 14:43:00 +00:00
Georg Brandl
d09def36d5 Bug #1442874: handle "<!>", the empty SGML comment 2006-03-09 13:27:14 +00:00
Fred Drake
75d9a62fe0 add tests that make sure buffer boundaries are handled properly for SGML comments
(see SF patch #901369)
2004-09-08 22:57:01 +00:00
Tim Peters
27f883687b Whitespace normalization. 2004-07-08 04:22:35 +00:00