tokenizers
From MaRDI portal
Software:28295
swMATH16425CRANtokenizersMaRDI QIDQ28295
Fast, Consistent Tokenization of Natural Language Text
Last update: 22 December 2022
Copyright license: MIT license, File License
Software version identifier: 0.3.0
Source code repository: https://github.com/cran/tokenizers
Related Items (13)
DOLDA: a regularized supervised topic model for high-dimensional multi-class regression ⋮ textrecipes ⋮ deeplr ⋮ DramaAnalysis ⋮ rslp ⋮ wactor ⋮ textfeatures ⋮ tidypmc ⋮ proustr ⋮ pdfsearch ⋮ covfefe ⋮ WhatsR ⋮ tidytext
This page was built for software: tokenizers