R/tokenizer.R
tokenize_to_df.Rd
Create tokenizing data.frame using Sudachi
tokenize_to_df(x, mode, instance = NULL)
Input text vectors
Select split mode (A, B, C)
This is optional if you already have an instance of <sudachipy.tokenizer.Tokenizer> Giving them a predefined instance will speed up their execution.
<sudachipy.tokenizer.Tokenizer>
if (FALSE) { tokenizer("Tokyo, Japan", mode = "A") }