cpython

mirror of https://github.com/AdaCore/cpython.git synced 2026-02-12 12:57:15 -08:00

Author	SHA1	Message	Date
Jeremy Hylton	d37292bb8d	Remove unused variable	2001-05-08 04:00:45 +00:00
Tim Peters	2cfe368283	Make unicode.join() work nice with iterators. This also required a change to string.join(), so that when the latter figures out in midstream that it really needs unicode.join() instead, unicode.join() can actually get all the sequence elements (i.e., there's no guarantee that the sequence passed to string.join() can be iterated over again by unicode.join(), so string.join() must not pass on the original sequence object anymore).	2001-05-05 05:36:48 +00:00
Tim Peters	b3d8d1f76c	A different approach to the problem reported in Patch #419651: Metrowerks on Mac adds 0x itself C std says %#x and %#X conversion of 0 do not add the 0x/0X base marker. Metrowerks apparently does. Mark Favas reported the same bug under a Compaq compiler on Tru64 Unix, but no other libc broken in this respect is known (known to be OK under MSVC and gcc). So just try the damn thing at runtime and see what the platform does. Note that we've always had bugs here, but never knew it before because a relevant test case didn't exist before 2.1.	2001-04-28 05:38:26 +00:00
Marc-André Lemburg	8155e0e541	This patch originated from an idea by Martin v. Loewis who submitted a patch for sharing single character Unicode objects. Martin's patch had to be reworked in a number of ways to take Unicode resizing into consideration as well. Here's what the updated patch implements: * Single character Unicode strings in the Latin-1 range are shared (not only ASCII chars as in Martin's original patch). * The ASCII and Latin-1 codecs make use of this optimization, providing a noticable speedup for single character strings. Most Unicode methods can use the optimization as well (by virtue of using PyUnicode_FromUnicode()). * Some code cleanup was done (replacing memcpy with Py_UNICODE_COPY) * The PyUnicode_Resize() can now also handle the case of resizing unicode_empty which previously resulted in an error. * Modified the internal API _PyUnicode_Resize() and the public PyUnicode_Resize() API to handle references to shared objects correctly. The _PyUnicode_Resize() signature changed due to this. * Callers of PyUnicode_FromUnicode() may now only modify the Unicode object contents of the returned object in case they called the API with NULL as content template. Note that even though this patch passes the regression tests, there may still be subtle bugs in the sharing code.	2001-04-23 14:44:21 +00:00
Tim Peters	cf96de052f	SF but #417587 : compiler warnings compiling 2.1. Repaired some of the SGI compiler warnings Sjoerd Mullender reported.	2001-04-21 02:46:11 +00:00
Tim Peters	78fe5308b4	CVS patch 416248: 2.1c1 unicodeobject: unused vrbl cleanup, from Mark Favas.	2001-04-19 21:55:14 +00:00
Jeremy Hylton	b8a93215c2	Revert previous checkin, which caused test_unicodedata to fail.	2001-04-19 16:43:49 +00:00
Martin v. Löwis	da3dc5b892	Patch #416953 : Cache ASCII characters to speed up ASCII decoding.	2001-04-18 12:49:15 +00:00
Tim Peters	fff5325078	Bug 415514 reported that e.g. "%#x" % 0 blew up, at heart because C sprintf supplies a base marker if and only if the value is not 0. I then fixed that, by tolerating C's inconsistency when it does %#x, and taking away that Python produced 0x0 when formatting 0L (the "long" flavor of 0) under %#x itself. But after talking with Guido, we agreed it would be better to supply 0x for the short int case too, despite that it's inconsistent with C, because C is inconsistent with itself and with Python's hex(0) (plus, while "%#x" % 0 didn't work before, "%#x" % 0L did, and returned "0x0"). Similarly for %#X conversion.	2001-04-12 18:38:48 +00:00
Tim Peters	711088d9b8	Fix for SF bug #415514 : "%#x" % 0 caused assertion failure/abort. http://sourceforge.net/tracker/index.php?func=detail&aid=415514&group_id=5470&atid=105470 For short ints, Python defers to the platform C library to figure out what %#x should do. The code asserted that the platform C returned a string beginning with "0x". However, that's not true when-- and only when --the value being formatted is 0. Changed the code to live with C's inconsistency here. In the meantime, the problem does not arise if you format a long 0 (0L) instead. However, that's because the code we wrote to do %#x conversions on longs produces a leading "0x" regardless of value. That's probably wrong too: we should drop leading "0x", for consistency with C, when (& only when) formatting 0L. So I changed the long formatting code to do that too.	2001-04-12 00:35:51 +00:00
Fredrik Lundh	ccc7473fc8	reorganized PyUnicode_DecodeUnicodeEscape a bit (in order to make it less likely that bug #132817 ever appears again)	2001-02-18 22:13:49 +00:00
Marc-André Lemburg	fde66e1bcc	Fixed .capitalize() method of Unicode objects to work like the corresponding string method. Added tests for this too. Patch written by Marc-Andre Lemburg. Copyright assigned to Guido van Rossum.	2001-01-29 11:14:16 +00:00
Ka-Ping Yee	fa004ad36c	Show '\011', '\012', and '\015' as '\t', '\n', '\r' in strings. Switch from octal escapes to hex escapes for other nonprintable characters.	2001-01-24 17:19:08 +00:00
Fredrik Lundh	06d126803c	Move uchhash functionality into unicodedata (after the recent crop of changes, the files are small enough to do this). Also adds "name" and "lookup" functions to unicodedata.	2001-01-24 07:59:11 +00:00
Fredrik Lundh	f60560626c	Better error message if ucnhash cannot be found (obscure attribute errors aren't that helpful), or doesn't contain what's expected from it. Also tweaked the test script so it compiles even if ucnhash is missing.	2001-01-20 11:15:25 +00:00
Fredrik Lundh	0fdb90cafe	refactored the unicodeobject/ucnhash interface, to hide the implementation details inside the ucnhash module. also cleaned up the unicode copyright blurb a little; Secret Labs' internal revision history isn't that interesting...	2001-01-19 09:45:02 +00:00
Marc-André Lemburg	ad7c98e264	This patch adds a new builtin unistr() which behaves like str() except that it always returns Unicode objects. A new C API PyObject_Unicode() is also provided. This closes patch #101664. Written by Marc-Andre Lemburg. Copyright assigned to Guido van Rossum.	2001-01-17 17:09:53 +00:00
Marc-André Lemburg	3a645e4dd4	Added checks to prevent PyUnicode_Count() from dumping core in case the parameters are out of bounds and fixes error handling for .count(), .startswith() and .endswith() for the case of mixed string/Unicode objects. This patch adds Python style index semantics to PyUnicode_Count() indices (including the special handling of negative indices). The patch is an extended version of patch #103249 submitted by Michael Hudson (mwh) on SF. It also includes new test cases.	2001-01-16 11:54:12 +00:00
Marc-André Lemburg	ec233e5803	This patch adds a new feature to the builtin charmap codec: The mapping dictionaries can now contain 1-n mappings, meaning that character ordinals may be mapped to strings or Unicode object, e.g. 0x0078 ('x') -> u"abc", causing the ordinal to be replaced by the complete string or Unicode object instead of just one character. Another feature introduced by the patch is that of mapping oridnals to the emtpy string. This allows removing characters. The patch is different from patch #103100 in that it does not cause a performance hit for the normal use case of 1-1 mappings. Written by Marc-Andre Lemburg, copyright assigned to Guido van Rossum.	2001-01-06 14:59:58 +00:00
Marc-André Lemburg	a866df806d	This patch changes the default behaviour of the builtin charmap codec to not apply Latin-1 mappings for keys which are not found in the mapping dictionaries, but instead treat them as undefined mappings. The patch was originally written by Martin v. Loewis with some additional (cosmetic) changes and an updated test script by Marc-Andre Lemburg. The standard codecs were recreated from the most current files available at the Unicode.org site using the Tools/scripts/gencodec.py tool. This patch closes the bugs #116285 and #119960.	2001-01-03 21:29:14 +00:00
Andrew M. Kuchling	f947ffe951	Patch #102940 : use only printable Unicode chars in reporting incorrect % characters; characters outside the printable range are replaced with '?'	2000-12-19 22:49:06 +00:00
Guido van Rossum	cda4f9a8dc	Fix off-by-one error in split_substring(). Fixes SF bug #122162 .	2000-12-19 02:23:19 +00:00
Andrew M. Kuchling	6ca8917758	[ Patch #102852 ] Make % error a bit more informative by indicates the index at which an unknown %-escape was found	2000-12-15 13:07:46 +00:00
Tim Peters	a3a3a030af	Fox for SF bug #123859 : %[duxXo] long formats inconsistent.	2000-11-30 05:22:44 +00:00
Barry Warsaw	5b4c22806f	_PyUnicode_Fini(): Initialize the local freelist walking variable `u' after unicode_empty has been freed, otherwise it might not point to the real start of the unicode_freelist. Final closure for SF bug #110681, Jitterbug PR#398.	2000-10-03 20:45:26 +00:00

1 2 3 4

90 Commits