C# Класс UniHax.Fuzzer

The Fuzzer has cases for some of the oddball manifestations of Unicode that can trip up software including: - non-character, reserved, and private use area code points - special meaning characters such as the BOM and RLO - ill-formed byte sequences - a half-surrogate code point
Показать файл Открыть проект

Открытые свойства

Свойство Тип Описание
u0390 string
u1D160 string
u1F82 string
uBOM string
uBoldEight string
uDAAD string
uDEAD string
uFB2C string
uFDFA string
uFullwidthSolidus string
uIdnaSs string
uMVS string
uNotACharacter string
uPrivate string
uRLO string
uReservedCodePoint string
uUnassigned string
uWordJoiner string

Открытые методы

Метод Описание
GetBom ( ) : string
GetCharacterBytes ( string encoding, string character ) : byte[]

Gets the requested byte representation of the current Unicode character codepoint

GetCharacterBytesMalformed ( string encoding, string character ) : byte[]

Malforms the bytes by removing the last byte from whichever encoding you specify.

OutOfRangeCodePointAsUtf32BE ( ) : byte[]

Return a UTF32 byte encoding for an illegal code point value U+1FFFFF. Note that Unicode 6.0 supports only up to U+10FFFF. UTF-8 percent encoding for something out of range is %F4%8F%BF%BE

Описание методов

GetBom() публичный Метод

public GetBom ( ) : string
Результат string

GetCharacterBytes() публичный Метод

Gets the requested byte representation of the current Unicode character codepoint
public GetCharacterBytes ( string encoding, string character ) : byte[]
encoding string The encoding you want a byte representation in. Specify utf-8, utf-16le, or utf16-be
character string A single character sent as a string.
Результат byte[]

GetCharacterBytesMalformed() публичный Метод

Malforms the bytes by removing the last byte from whichever encoding you specify.
public GetCharacterBytesMalformed ( string encoding, string character ) : byte[]
encoding string The encoding you want a byte representation in. Specify utf-8, utf-16le, or utf16-be
character string A single character sent as a string.
Результат byte[]

OutOfRangeCodePointAsUtf32BE() публичный Метод

Return a UTF32 byte encoding for an illegal code point value U+1FFFFF. Note that Unicode 6.0 supports only up to U+10FFFF. UTF-8 percent encoding for something out of range is %F4%8F%BF%BE
public OutOfRangeCodePointAsUtf32BE ( ) : byte[]
Результат byte[]

Описание свойств

u0390 публичное статическое свойство

U+0390 expands by 3x (UTF-8) under NFD UTF-8 percent encoding is %CE%90
public static string u0390
Результат string

u1D160 публичное статическое свойство

U+1D160 expands by 3x (UTF-8) under NFC UTF-8 percent encoding is %F0%9D%85%A0
public static string u1D160
Результат string

u1F82 публичное статическое свойство

U+1F82 expands by 4x (UTF-16) under NFD UTF-8 percent encoding is %E1%BE%82
public static string u1F82
Результат string

uBOM публичное статическое свойство

The Byte Order Mark U+FEFF is a special character defining the byte order and endianess of text data. UTF-8 percent encoding is %EF%BB%BF
public static string uBOM
Результат string

uBoldEight публичное статическое свойство

Code point with a numerical mapping and value U+1D7D6 MATHEMATICAL BOLD DIGIT EIGHT UTF-8 percent encoding is %F0%9D%9F%96
public static string uBoldEight
Результат string

uDAAD публичное статическое свойство

An illegal high half-surrogate U+DAAD UTF-8 percent encoding is %ed%aa%ad
public static string uDAAD
Результат string

uDEAD публичное статическое свойство

An illegal low half-surrogate U+DEAD UTF-8 percent encoding is %ed%ba%ad
public static string uDEAD
Результат string

uFB2C публичное статическое свойство

U+FB2C expands by 3x (UTF-16) under NFC UTF-8 percent encoding is %EF%AC%AC
public static string uFB2C
Результат string

uFDFA публичное статическое свойство

U+FDFD expands by 11x (UTF-8) and 18x (UTF-16) under NFKC/NFKC UTF-8 percent encoding is %EF%B7%BA
public static string uFDFA
Результат string

uFullwidthSolidus публичное статическое свойство

U+FF0F FULLWIDTH SOLIDUS should normalize to / in a hostname UTF-8 percent encoding is %EF%BC%8F
public static string uFullwidthSolidus
Результат string

uIdnaSs публичное статическое свойство

IDNA2003/2008 Deviant - U+00DF normalizes to "ss" during IDNA2003's mapping phase, different from its IDNA2008 mapping. See http://www.unicode.org/reports/tr46/ UTF-8 percent encoding is %C3%9F
public static string uIdnaSs
Результат string

uMVS публичное статическое свойство

Mongolian Vowel Separator U+180E is invisible and has the whitespace property. UTF-8 percent encoding is %E1%A0%8E
public static string uMVS
Результат string

uNotACharacter публичное статическое свойство

The code point U+FFFF is guaranteed to not be a Unicode character at all UTF-8 percent encoding is %ef%bf%bf
public static string uNotACharacter
Результат string

uPrivate публичное статическое свойство

A Private Use Area code point U+F8FF which Apple happens to use for its logo. UTF-8 percent encoding is %EF%A3%BF
public static string uPrivate
Результат string

uRLO публичное статическое свойство

The Right to Left Override U+202E defines special meaning to re-order the display of text for right-to-left reading. UTF-8 percent encoding is %E2%80%AE
public static string uRLO
Результат string

uReservedCodePoint публичное статическое свойство

A reserved code point U+FEFE UTF-8 percent encoding is %ef%bb%be
public static string uReservedCodePoint
Результат string

uUnassigned публичное статическое свойство

An unassigned code point U+0FED UTF-8 percent encoding is %e0%bf%ad
public static string uUnassigned
Результат string

uWordJoiner публичное статическое свойство

Word Joiner U+2060 is an invisible zero-width character. UTF-8 percent encoding is %E2%81%A0
public static string uWordJoiner
Результат string