Parse tokenized input text

form(x, mode, type, pos = TRUE, ...)

Arguments

x

Input text vectors

mode

Select split mode (A, B, C)

type

return form. One of the following "surface", "dictionary", "normalized", "reading" or "part_of_speech".

pos

Include part of speech information with object name.

...

path to tokenizer argument.

Examples

if (FALSE) { form("Tokyo", mode = "B", type = "normalized") form("Osaka", mode = "B", type = "surface") form("Hokkaido", mode = "C", type = "part_of_speech") }