C# Класс UniHax.Fuzzer

The Fuzzer has cases for some of the oddball manifestations of Unicode that can trip up software including: - non-character, reserved, and private use area code points - special meaning characters such as the BOM and RLO - ill-formed byte sequences - a half-surrogate code point

Показать файл Открыть проект

Открытые свойства

Свойство	Тип	Описание
u0390	string
u1D160	string
u1F82	string
uBOM	string
uBoldEight	string
uDAAD	string
uDEAD	string
uFB2C	string
uFDFA	string
uFullwidthSolidus	string
uIdnaSs	string
uMVS	string
uNotACharacter	string
uPrivate	string
uRLO	string
uReservedCodePoint	string
uUnassigned	string
uWordJoiner	string

Открытые методы

Метод	Описание
GetBom ( ) : string
GetCharacterBytes ( string encoding, string character ) : byte[]	Gets the requested byte representation of the current Unicode character codepoint
GetCharacterBytesMalformed ( string encoding, string character ) : byte[]	Malforms the bytes by removing the last byte from whichever encoding you specify.
OutOfRangeCodePointAsUtf32BE ( ) : byte[]	Return a UTF32 byte encoding for an illegal code point value U+1FFFFF. Note that Unicode 6.0 supports only up to U+10FFFF. UTF-8 percent encoding for something out of range is %F4%8F%BF%BE

Описание методов

GetBom() публичный Метод

public GetBom ( ) : string
Результат	string

GetCharacterBytes() публичный Метод

Gets the requested byte representation of the current Unicode character codepoint

public GetCharacterBytes ( string encoding, string character ) : byte[]
encoding	string	The encoding you want a byte representation in. Specify utf-8, utf-16le, or utf16-be
character	string	A single character sent as a string.
Результат	byte[]

GetCharacterBytesMalformed() публичный Метод

Malforms the bytes by removing the last byte from whichever encoding you specify.

public GetCharacterBytesMalformed ( string encoding, string character ) : byte[]
encoding	string	The encoding you want a byte representation in. Specify utf-8, utf-16le, or utf16-be
character	string	A single character sent as a string.
Результат	byte[]

OutOfRangeCodePointAsUtf32BE() публичный Метод

Return a UTF32 byte encoding for an illegal code point value U+1FFFFF. Note that Unicode 6.0 supports only up to U+10FFFF. UTF-8 percent encoding for something out of range is %F4%8F%BF%BE

public OutOfRangeCodePointAsUtf32BE ( ) : byte[]
Результат	byte[]

Описание свойств

u0390 публичное статическое свойство

U+0390 expands by 3x (UTF-8) under NFD UTF-8 percent encoding is %CE%90

public static string u0390
Результат	string

u1D160 публичное статическое свойство

U+1D160 expands by 3x (UTF-8) under NFC UTF-8 percent encoding is %F0%9D%85%A0

public static string u1D160
Результат	string

u1F82 публичное статическое свойство

U+1F82 expands by 4x (UTF-16) under NFD UTF-8 percent encoding is %E1%BE%82

public static string u1F82
Результат	string

uBOM публичное статическое свойство

The Byte Order Mark U+FEFF is a special character defining the byte order and endianess of text data. UTF-8 percent encoding is %EF%BB%BF

public static string uBOM
Результат	string

uBoldEight публичное статическое свойство

Code point with a numerical mapping and value U+1D7D6 MATHEMATICAL BOLD DIGIT EIGHT UTF-8 percent encoding is %F0%9D%9F%96

public static string uBoldEight
Результат	string

uDAAD публичное статическое свойство

An illegal high half-surrogate U+DAAD UTF-8 percent encoding is %ed%aa%ad

public static string uDAAD
Результат	string

uDEAD публичное статическое свойство

An illegal low half-surrogate U+DEAD UTF-8 percent encoding is %ed%ba%ad

public static string uDEAD
Результат	string

uFB2C публичное статическое свойство

U+FB2C expands by 3x (UTF-16) under NFC UTF-8 percent encoding is %EF%AC%AC

public static string uFB2C
Результат	string

uFDFA публичное статическое свойство

U+FDFD expands by 11x (UTF-8) and 18x (UTF-16) under NFKC/NFKC UTF-8 percent encoding is %EF%B7%BA

public static string uFDFA
Результат	string

uFullwidthSolidus публичное статическое свойство

U+FF0F FULLWIDTH SOLIDUS should normalize to / in a hostname UTF-8 percent encoding is %EF%BC%8F

public static string uFullwidthSolidus
Результат	string

uIdnaSs публичное статическое свойство

IDNA2003/2008 Deviant - U+00DF normalizes to "ss" during IDNA2003's mapping phase, different from its IDNA2008 mapping. See http://www.unicode.org/reports/tr46/ UTF-8 percent encoding is %C3%9F

public static string uIdnaSs
Результат	string

uMVS публичное статическое свойство

Mongolian Vowel Separator U+180E is invisible and has the whitespace property. UTF-8 percent encoding is %E1%A0%8E

public static string uMVS
Результат	string

uNotACharacter публичное статическое свойство

The code point U+FFFF is guaranteed to not be a Unicode character at all UTF-8 percent encoding is %ef%bf%bf

public static string uNotACharacter
Результат	string

uPrivate публичное статическое свойство

A Private Use Area code point U+F8FF which Apple happens to use for its logo. UTF-8 percent encoding is %EF%A3%BF

public static string uPrivate
Результат	string

uRLO публичное статическое свойство

The Right to Left Override U+202E defines special meaning to re-order the display of text for right-to-left reading. UTF-8 percent encoding is %E2%80%AE

public static string uRLO
Результат	string

uReservedCodePoint публичное статическое свойство

A reserved code point U+FEFE UTF-8 percent encoding is %ef%bb%be

public static string uReservedCodePoint
Результат	string

uUnassigned публичное статическое свойство

An unassigned code point U+0FED UTF-8 percent encoding is %e0%bf%ad

public static string uUnassigned
Результат	string

uWordJoiner публичное статическое свойство

Word Joiner U+2060 is an invisible zero-width character. UTF-8 percent encoding is %E2%81%A0

public static string uWordJoiner
Результат	string