C# Class UniHax.Fuzzer

The Fuzzer has cases for some of the oddball manifestations of Unicode that can trip up software including: - non-character, reserved, and private use area code points - special meaning characters such as the BOM and RLO - ill-formed byte sequences - a half-surrogate code point

Afficher le fichier Open project: cweb/unicode-hax

Méthodes publiques

Свойство	Type	Description
u0390	string
u1D160	string
u1F82	string
uBOM	string
uBoldEight	string
uDAAD	string
uDEAD	string
uFB2C	string
uFDFA	string
uFullwidthSolidus	string
uIdnaSs	string
uMVS	string
uNotACharacter	string
uPrivate	string
uRLO	string
uReservedCodePoint	string
uUnassigned	string
uWordJoiner	string

Méthodes publiques

Méthode	Description
GetBom ( ) : string
GetCharacterBytes ( string encoding, string character ) : byte[]	Gets the requested byte representation of the current Unicode character codepoint
GetCharacterBytesMalformed ( string encoding, string character ) : byte[]	Malforms the bytes by removing the last byte from whichever encoding you specify.
OutOfRangeCodePointAsUtf32BE ( ) : byte[]	Return a UTF32 byte encoding for an illegal code point value U+1FFFFF. Note that Unicode 6.0 supports only up to U+10FFFF. UTF-8 percent encoding for something out of range is %F4%8F%BF%BE

Method Details

GetBom() public méthode

public GetBom ( ) : string
Résultat	string

GetCharacterBytes() public méthode

Gets the requested byte representation of the current Unicode character codepoint

public GetCharacterBytes ( string encoding, string character ) : byte[]
encoding	string	The encoding you want a byte representation in. Specify utf-8, utf-16le, or utf16-be
character	string	A single character sent as a string.
Résultat	byte[]

GetCharacterBytesMalformed() public méthode

Malforms the bytes by removing the last byte from whichever encoding you specify.

public GetCharacterBytesMalformed ( string encoding, string character ) : byte[]
encoding	string	The encoding you want a byte representation in. Specify utf-8, utf-16le, or utf16-be
character	string	A single character sent as a string.
Résultat	byte[]

OutOfRangeCodePointAsUtf32BE() public méthode

Return a UTF32 byte encoding for an illegal code point value U+1FFFFF. Note that Unicode 6.0 supports only up to U+10FFFF. UTF-8 percent encoding for something out of range is %F4%8F%BF%BE

public OutOfRangeCodePointAsUtf32BE ( ) : byte[]
Résultat	byte[]

Property Details

u0390 public_oe static_oe property

U+0390 expands by 3x (UTF-8) under NFD UTF-8 percent encoding is %CE%90

public static string u0390
Résultat	string

u1D160 public_oe static_oe property

U+1D160 expands by 3x (UTF-8) under NFC UTF-8 percent encoding is %F0%9D%85%A0

public static string u1D160
Résultat	string

u1F82 public_oe static_oe property

U+1F82 expands by 4x (UTF-16) under NFD UTF-8 percent encoding is %E1%BE%82

public static string u1F82
Résultat	string

uBOM public_oe static_oe property

The Byte Order Mark U+FEFF is a special character defining the byte order and endianess of text data. UTF-8 percent encoding is %EF%BB%BF

public static string uBOM
Résultat	string

uBoldEight public_oe static_oe property

Code point with a numerical mapping and value U+1D7D6 MATHEMATICAL BOLD DIGIT EIGHT UTF-8 percent encoding is %F0%9D%9F%96

public static string uBoldEight
Résultat	string

uDAAD public_oe static_oe property

An illegal high half-surrogate U+DAAD UTF-8 percent encoding is %ed%aa%ad

public static string uDAAD
Résultat	string

uDEAD public_oe static_oe property

An illegal low half-surrogate U+DEAD UTF-8 percent encoding is %ed%ba%ad

public static string uDEAD
Résultat	string

uFB2C public_oe static_oe property

U+FB2C expands by 3x (UTF-16) under NFC UTF-8 percent encoding is %EF%AC%AC

public static string uFB2C
Résultat	string

uFDFA public_oe static_oe property

U+FDFD expands by 11x (UTF-8) and 18x (UTF-16) under NFKC/NFKC UTF-8 percent encoding is %EF%B7%BA

public static string uFDFA
Résultat	string

uFullwidthSolidus public_oe static_oe property

U+FF0F FULLWIDTH SOLIDUS should normalize to / in a hostname UTF-8 percent encoding is %EF%BC%8F

public static string uFullwidthSolidus
Résultat	string

uIdnaSs public_oe static_oe property

IDNA2003/2008 Deviant - U+00DF normalizes to "ss" during IDNA2003's mapping phase, different from its IDNA2008 mapping. See http://www.unicode.org/reports/tr46/ UTF-8 percent encoding is %C3%9F

public static string uIdnaSs
Résultat	string

uMVS public_oe static_oe property

Mongolian Vowel Separator U+180E is invisible and has the whitespace property. UTF-8 percent encoding is %E1%A0%8E

public static string uMVS
Résultat	string

uNotACharacter public_oe static_oe property

The code point U+FFFF is guaranteed to not be a Unicode character at all UTF-8 percent encoding is %ef%bf%bf

public static string uNotACharacter
Résultat	string

uPrivate public_oe static_oe property

A Private Use Area code point U+F8FF which Apple happens to use for its logo. UTF-8 percent encoding is %EF%A3%BF

public static string uPrivate
Résultat	string

uRLO public_oe static_oe property

The Right to Left Override U+202E defines special meaning to re-order the display of text for right-to-left reading. UTF-8 percent encoding is %E2%80%AE

public static string uRLO
Résultat	string

uReservedCodePoint public_oe static_oe property

A reserved code point U+FEFE UTF-8 percent encoding is %ef%bb%be

public static string uReservedCodePoint
Résultat	string

uUnassigned public_oe static_oe property

An unassigned code point U+0FED UTF-8 percent encoding is %e0%bf%ad

public static string uUnassigned
Résultat	string

uWordJoiner public_oe static_oe property

Word Joiner U+2060 is an invisible zero-width character. UTF-8 percent encoding is %E2%81%A0

public static string uWordJoiner
Résultat	string