Ethereal-dev: Re: [ethereal-dev] String types

Note: This archive is from the project's previous web site, ethereal.com. This list is no longer active.

Date: Tue, 2 May 2000 09:40:14 +0200
> Are your strings Unicode strings, or some other character set?  I'd be
> inclined to call them FT_UNICODE if they're just Unicode (and perhaps
> change FT_STRING to FT_ASCII, although, in practice, the 8-bit character
> strings might be in other 8-bit character sets, e.g. various other ISO
> 646 character sets, or ASCII supersets such as the ISO 8859/n character
> sets, or multi-byte character ASCII supersets such as the EUC character
> sets, or...).

This is indeed better. I think the most important ones are:
- FT_ASCII ( 7bit )
- FT_ISO_8859
- FT_UNICODE
- FT_UTF8

UTF8 is, I believe, used widely in Java, so I suppose that any Java 
initiated communication (RMI?) wil use it too. An it also opens a 
whole can of worms, because the characters are not fixed length. From
what I have read sofar it can be anything between 1 and six bytes :-(

>From a book about ASN.1 I also got several other ITU-T and ISO related character sets, but 
most of them are just limited ASCII or very obsolete.

-- 
Andreas Sikkema
andreas.sikkema@xxxxxxxxxxx
"Standing barefoot in a river of clues, most people would 
         not get their toes wet." - Brian Kantor in a.s.r.