It is designed to work with text written in Bahasa Malaysia. We provide functions and data sets that will make working with Bahasa Malaysia text much easier. For word stemming in particular, we will look up the Malay words in a dictionary and then proceed to remove "extra suffix" as explained in Khan, Rehman Ullah, Fitri Suraya Mohamad, Muh Inam UlHaq, Shahren Ahmad Zadi Adruce, Philip Nuli Anding, Sajjad Nawaz Khan, and Abdulrazak Yahya Saleh Al-Hababi (2017) <https://ijrest.net/vol-4-issue-12.html> . This package includes a dictionary of Malay words that may be used to perform word stemming, a dataset of Malay stop words, a dataset of sentiment words and a dataset of normalized words.
Version: | 0.1.3 |
Depends: | R (≥ 2.10) |
Imports: | dplyr, magrittr, rlang, stringr |
Suggests: | rmarkdown, knitr, testthat (≥ 3.0.0) |
Published: | 2023-01-17 |
DOI: | 10.32614/CRAN.package.malaytextr |
Author: | Zahier Nasrudin [aut, cre] |
Maintainer: | Zahier Nasrudin <zahiernasrudin at gmail.com> |
BugReports: | https://github.com/zahiernasrudin/malaytextr/issues |
License: | MIT + file LICENSE |
URL: | https://github.com/zahiernasrudin/malaytextr |
NeedsCompilation: | no |
Materials: | README NEWS |
CRAN checks: | malaytextr results |
Reference manual: | malaytextr.pdf |
Vignettes: |
malaytextr |
Package source: | malaytextr_0.1.3.tar.gz |
Windows binaries: | r-devel: malaytextr_0.1.3.zip, r-release: malaytextr_0.1.3.zip, r-oldrel: malaytextr_0.1.3.zip |
macOS binaries: | r-release (arm64): malaytextr_0.1.3.tgz, r-oldrel (arm64): malaytextr_0.1.3.tgz, r-release (x86_64): malaytextr_0.1.3.tgz, r-oldrel (x86_64): malaytextr_0.1.3.tgz |
Old sources: | malaytextr archive |
Please use the canonical form https://CRAN.R-project.org/package=malaytextr to link to this page.