Classes | |
class | mapping |
Class the holds boundary mapping of the text that can be used with iterators. More... | |
class | token_iterator |
token_iterator is an iterator that returns text chunks between boundary positions More... | |
class | break_iterator |
break_iterator is bidirectional iterator that returns text boundary positions More... | |
Enumerations | |
enum | boundary_type { character, word, sentence, line } |
enum | word_type { word_none = 0x0000F, word_number = 0x000F0, word_letter = 0x00F00, word_kana = 0x0F000, word_ideo = 0xF0000, word_any = 0xFFFF0, word_letters = 0xFFF00, word_kana_ideo = 0xFF000, word_mask = 0xFFFFF } |
enum | line_break_type { line_soft = 0x0F, line_hard = 0xF0, line_any = 0xFF, line_mask = 0xFF } |
enum | sentence_break_type { sentence_term = 0x0F, sentence_sep = 0xF0, sentence_any = 0xFF, sentence_mask = 0xFF } |
enum | character_break_type { character_any = 0xF, character_mask = 0xF } |
Functions | |
unsigned | boundary_mask (boundary_type t) |