Wireshark · Ethereal-dev: Re: [Ethereal-dev] Syntax for frame contains

Ethereal-dev: Re: [Ethereal-dev] Syntax for frame contains

Note: This archive is from the project's previous web site, ethereal.com. This list is no longer active.

From: Guy Harris <guy@xxxxxxxxxxxx>

Date: Wed, 27 Aug 2003 13:48:46 -0700


On Wednesday, August 27, 2003, at 11:55 AM, Gilbert Ramirez wrote:

I don't know Unicode very well, so I don't know all the different types
of Unicode encodings, so I won't even guess as to what the names for
those "functions" would be, but they would follow the above example.

(For now, we don't support non-ASCII characters very well in Ethereal,so I'll assume only ASCII in search strings for now.)


The encodings we'll probably have to deal with are:

1) little-endian UCS-2 - 2-byte characters, with the lower 8 bitsfirst and the upper 8 bits after that (used in SMB and various DCE RPCprotocols from Microsoft)

2) big-endian UCS-2 - (I don't know whether there are any protocolsthat do that - perhaps some DCE RPC-based protocols if the sender isbig-endian);

3) UTF-8 - ASCII characters map to 1 byte containing the character,other characters map to multiple bytes (note that UTF-8 can encode4-byte characters, so it gets ISO 10646 in its entirety, not just theBasic Multilingual Plane subset that's handled by UCS-2).

Unicode has a "byte order mark", which is a character that's a "zerowidth no-break space" (i.e., a space character that takes no space :-))- the byte-swapped version of it is not a legal Unicode character (andnever will be, as far as I know), so a Unicode string can start with abyte order mark, and something scanning it can infer the byte orderfrom that byte order mark. Not all Unicode strings necessarily beginwith a byte order mark, however; Microsoft don't use it in SMB or theirRPCs, for example. (The byte order is implicitly little-endian forSMB; it's presumably the byte order from the DCE RPC header in theRPCs, although, in practice, little-endian might even be used onbig-endian machines, at least for the Microsoft RPCs.)

References:
- Re: [Ethereal-dev] Syntax for frame contains
  - From: Greg Morris
- Re: [Ethereal-dev] Syntax for frame contains
  - From: Gilbert Ramirez

Prev by Date: Re: [Ethereal-dev] Syntax for frame contains
Next by Date: Re: [Ethereal-dev] Typo in packet-eth.c
Previous by thread: Re: [Ethereal-dev] Syntax for frame contains
Next by thread: Re: [Ethereal-dev] Syntax for frame contains
Index(es):
- Date
- Thread

Riverbed Cascade Pilot: Take Wireshark to the Next Level - Advanced Triggers and Alerts; Web and VoIP Analytics; Long-Term Trending and Forensics; Deep Packet Analysis with Wireshark

Riverbed Cascade Pilot Personal Edition: Take Wireshark to the Next Level - Advanced Triggers and Alerts; Web and VoIP Analytics; Long-Term Trending and Forensics; Deep Packet Analysis with Wireshark

Riverbed AirPcap: Complete Visibility of Your Wireless Networks; Multi-Channel, Aggregated Analysis; Portable and Versatile; Easy to Setup and Easy to Use; Ready to Power Your Application

$Riverbed TurboCap: Full-Speed GbE Capture; Port Aggregation; Pass-thru Mode; Aggregating Tap; Full-Speed GbE Injection; Exported Interfaces; TurboCap API Developer\'s Pack$