Walter Dörwald
5d23f9a8a3
Don't add multiple empty lines at the end of the codec. With this a
...
regenerated codec should survive reindent.py unchanged.
2006-03-31 10:13:10 +00:00
Walter Dörwald
cff22083f1
Whitespace for generated code.
2006-03-27 15:11:56 +00:00
Hye-Shik Chang
e2ac4abd01
Patch #1443155 : Add the incremental codecs support for CJK codecs.
...
(reviewed by Walter Dörwald)
2006-03-26 02:34:59 +00:00
Walter Dörwald
abb02e5994
Patch #1436130 : codecs.lookup() now returns a CodecInfo object (a subclass
...
of tuple) that provides incremental decoders and encoders (a way to use
stateful codecs without the stream API). Functions
codecs.getincrementaldecoder() and codecs.getincrementalencoder() have
been added.
2006-03-15 11:35:15 +00:00
Martin v. Löwis
43179c8e6f
Add changelog entry.
2006-03-11 12:43:44 +00:00
Tim Peters
88ca467ca4
Whitespace normalization.
2006-03-10 23:39:56 +00:00
Martin v. Löwis
480f1bb67b
Update Unicode database to Unicode 4.1.
2006-03-09 23:38:20 +00:00
Tim Peters
536cf99536
Whitespace normalization.
2005-12-25 23:18:31 +00:00
Marc-André Lemburg
68b49ef8a1
Add Makefile which allows easily rebuilding the charmap codecs.
2005-10-25 11:55:01 +00:00
Marc-André Lemburg
89bbfd4a36
Add custom mapping files used for generating some of the charmap
...
codecs.
2005-10-25 11:54:04 +00:00
Marc-André Lemburg
bd20ea55bc
Apply some cosmetic fixes to the output of the script.
...
Only include the decoding map if no table can be generated.
2005-10-25 11:53:33 +00:00
Marc-André Lemburg
92b201debc
Add two new tools to compare codecs and show differences and to
...
list all installed codecs.
2005-10-21 13:47:03 +00:00
Marc-André Lemburg
c5694c8bf4
Moved gencodec.py to the Tools/unicode/ directory.
...
Added new support for decoding tables.
Cleaned up the implementation a bit.
2005-10-21 13:45:17 +00:00
Hye-Shik Chang
e9ddfbb412
SF #989185 : Drop unicode.iswide() and unicode.width() and add
...
unicodedata.east_asian_width(). You can still implement your own
simple width() function using it like this:
def width(u):
w = 0
for c in unicodedata.normalize('NFC', u):
cwidth = unicodedata.east_asian_width(c)
if cwidth in ('W', 'F'): w += 2
else: w += 1
return w
2004-08-04 07:38:35 +00:00
Tim Peters
182b5aca27
Whitespace normalization, via reindent.py.
2004-07-18 06:16:08 +00:00
Hye-Shik Chang
974ed7cfa5
- SF #962502 : Add two more methods for unicode type; width() and
...
iswide() for east asian width manipulation. (Inspired by David
Goodger, Reviewed by Martin v. Loewis)
- Move _PyUnicode_TypeRecord.flags to the end of the struct so that
no padding is added for UCS-4 builds. (Suggested by Martin v. Loewis)
2004-06-02 16:49:17 +00:00
Armin Rigo
ba91b9fdda
Applying SF patch #949329 on behalf of Raymond Hettinger.
2004-05-19 19:10:18 +00:00
Martin v. Löwis
2548c730c1
Implement IDNA (Internationalized Domain Names in Applications).
2003-04-18 10:39:54 +00:00
Martin v. Löwis
b5c980b802
Add unidata_version. Bump generator version number.
2002-11-25 09:13:37 +00:00
Martin v. Löwis
97225da29a
Sort names independent of the Python version. Fix hex constant warning.
...
Include all First/Last blocks.
2002-11-24 23:05:09 +00:00
Martin v. Löwis
677bde2dd1
Patch #626485 : Support Unicode normalization.
2002-11-23 22:08:15 +00:00
Martin v. Löwis
99ac3283e7
Verify that lower-higher case delta are 16-bit.
2002-10-18 17:34:18 +00:00
Martin v. Löwis
9def6a3a77
Update to Unicode 3.2 database.
2002-10-18 16:11:54 +00:00
Walter Dörwald
aaab30e00c
Apply diff2.txt from SF patch http://www.python.org/sf/572113
...
(with one small bugfix in bgen/bgen/scantools.py)
This replaces string module functions with string methods
for the stuff in the Tools directory. Several uses of
string.letters etc. are still remaining.
2002-09-11 20:36:02 +00:00
Fredrik Lundh
b2dfd73bdc
Unicode nits: Don't include unicodedatabase.h no more. And make sure
...
to build *all* tables in makeunicodedata.py.
2001-01-21 23:31:52 +00:00