Modify ↓
Ticket #197 (new defect)
Non-BMP unicode characters are dumped using surrogate code units if python was not configured with UCS4
| Reported by: | travis.mcleskey@… | Owned by: | xi |
|---|---|---|---|
| Priority: | normal | Component: | pyyaml |
| Severity: | normal | Keywords: | |
| Cc: |
Description
If python is built with UCS2 (the default, for example, on OS X), you get:
yaml.dump(u'\U0001D10C')
"\uD834\uDD0C"
The output should instead be "\U0001D10C", since the surrogate code units are not valid unicode characters.
Attachments
Change History
Note: See
TracTickets for help on using
tickets.
