tokenizers.bpe
Byte Pair Encoding Text Tokenization
Unsupervised text tokenizer focused on computational efficiency. Wraps the 'YouTokenToMe' library https://github.com/VKCOM/YouTokenToMe which is an implementation of fast Byte Pair Encoding https://aclanthology.org/P16-1162/.
- Version0.1.3
- R versionunknown
- LicenseMPL-2.0
- Needs compilation?Yes
- Last release09/15/2023
Documentation
Team
Jan Wijffels
BNOSAC
Show author detailsRolesCopyright holderThe Abseil Authors
Show author detailsRolesContributor, Copyright holderGregory Popovitch
Show author detailsRolesContributor, Copyright holderVK.com
Show author detailsRolesCopyright holderIvan Belonogov
Show author detailsRolesContributor, Copyright holder
Insights
Last 30 days
The following line graph shows the downloads per day. You can hover over the graph to see the exact number of downloads per day.
Last 365 days
The following line graph shows the downloads per day. You can hover over the graph to see the exact number of downloads per day.
Data provided by CRAN
Binaries
Dependencies
- Imports1 package
- Linking To1 package
- Reverse Suggests3 packages