Ethereal-dev: Re: [Ethereal-dev] Unicode strings ...

Note: This archive is from the project's previous web site, ethereal.com. This list is no longer active.

From: Richard Sharpe <sharpe@xxxxxxxxxx>
Date: Sun, 12 Aug 2001 19:43:08 +0930
Tim Potter wrote:
Richard Sharpe writes:


Hmm... do you mean strings of wide characters or some other encoding (
utf-8?).

Good question. I was thinking specifically of what MS puts on the wire, which I think is wide characters, but we need to think this issue through. Internally, Samba will be using UTF-8, I think, (although, this is irrelevant) and there may be other protocols that use other forms of UNICODE.


NT actually uses UCS2-LE (i.e 2 byte characters, little endian)
on the wire for the rpc calls.  So it's a bit more complicated
than having ustring calls.  )-:

Hmmm, but don't they look like lots of UCS2-LE characters, followed by 0x000x00?

If so, I can handle these with a proto_tree_add_ustring_format or some such.

The question that remains is do we want to be able to handle different Unicode formats? That is, will we need to handle other protocols that may have different on-the-wire formats? If so, do we want to allow for them now?

Tim.

_______________________________________________
Ethereal-dev mailing list
Ethereal-dev@xxxxxxxxxxxx
http://www.ethereal.com/mailman/listinfo/ethereal-dev




--
Richard Sharpe, rsharpe@xxxxxxxxxx, LPIC-1
www.samba.org, www.ethereal.com, SAMS Teach Yourself Samba
in 24 Hours, Special Edition, Using Samba