Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It’s not just broken emoji, it’s straight up broken content: UTF-8 can not represent unpaired surrogates.

WTF-8 is necessary for Rust’s compatibility with Windows filesystems (it underlines OsString on Windows) as e.g. file names are sequences of UTF-16 code units (and thus may contain unpaired surrogates).



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: