This is an archive of the discontinued LLVM Phabricator instance.

[flang] Always encode multi-byte output in UTF-8
ClosedPublic

Authored by klausler on Apr 13 2022, 12:35 PM.

Details

Summary

A recent change to implement UTF-8 encoding should have
made the encoding conditional only for CHARACTER(KIND=1)
to enable UTF-8 output vs. Latin-1 or whatever. UTF-8 output
of wider CHARACTER kinds should not be conditional (until we choose
to support UCS-16, maybe). So wider CHARACTER kinds are being
emitted with extra zero bytes; this patch fixes them.

Diff Detail

Event Timeline

klausler created this revision.Apr 13 2022, 12:35 PM
Herald added a project: Restricted Project. · View Herald TranscriptApr 13 2022, 12:35 PM
Herald added a subscriber: jdoerfert. · View Herald Transcript
klausler requested review of this revision.Apr 13 2022, 12:35 PM
jeanPerier accepted this revision.Apr 14 2022, 2:26 AM

Thanks

This revision is now accepted and ready to land.Apr 14 2022, 2:26 AM
This revision was automatically updated to reflect the committed changes.