Writing utf16 to file in binary mode(以二进制模式将 utf16 写入文件)
问题描述
我正在尝试以二进制模式使用 ofstream 将 wstring 写入文件,但我认为我做错了什么.这是我试过的:
I'm trying to write a wstring to file with ofstream in binary mode, but I think I'm doing something wrong. This is what I've tried:
ofstream outFile("test.txt", std::ios::out | std::ios::binary);
wstring hello = L"hello";
outFile.write((char *) hello.c_str(), hello.length() * sizeof(wchar_t));
outFile.close();
在例如 Firefox 中打开 test.txt,编码设置为 UTF16,它将显示为:
Opening test.txt in for example Firefox with encoding set to UTF16 it will show as:
嘿嘿嘿
谁能告诉我为什么会这样?
Could anyone tell me why this happens?
在十六进制编辑器中打开文件我得到:
Opening the file in a hex editor I get:
FF FE 68 00 00 00 65 00 00 00 6C 00 00 00 6C 00 00 00 6F 00 00 00
看起来由于某种原因,我在每个字符之间多出了两个字节?
Looks like I get two extra bytes in between every character for some reason?
推荐答案
我怀疑在您的环境中 sizeof(wchar_t) 是 4 - 即它写出的是 UTF-32/UCS-4 而不是 UTF-16.这当然是十六进制转储的样子.
I suspect that sizeof(wchar_t) is 4 in your environment - i.e. it's writing out UTF-32/UCS-4 instead of UTF-16. That's certainly what the hex dump looks like.
这很容易测试(只需打印 sizeof(wchar_t)),但我很确定这是怎么回事.
That's easy enough to test (just print out sizeof(wchar_t)) but I'm pretty sure it's what's going on.
要从 UTF-32 wstring 转换为 UTF-16,您需要应用正确的编码,因为代理对开始发挥作用.
To go from a UTF-32 wstring to UTF-16 you'll need to apply a proper encoding, as surrogate pairs come into play.
这篇关于以二进制模式将 utf16 写入文件的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持编程学习网!
本文标题为:以二进制模式将 utf16 写入文件


- C++ 协变模板 2021-01-01
- Stroustrup 的 Simple_window.h 2022-01-01
- 近似搜索的工作原理 2021-01-01
- 一起使用 MPI 和 OpenCV 时出现分段错误 2022-01-01
- 与 int by int 相比,为什么执行 float by float 矩阵乘法更快? 2021-01-01
- 从python回调到c++的选项 2022-11-16
- 静态初始化顺序失败 2022-01-01
- 使用/clr 时出现 LNK2022 错误 2022-01-01
- 如何对自定义类的向量使用std::find()? 2022-11-07
- STL 中有 dereference_iterator 吗? 2022-01-01