malaytextr: Text Mining for Bahasa Malaysia

It is designed to work with text written in Bahasa Malaysia. We provide functions and data sets that will make working with Bahasa Malaysia text much easier. For word stemming in particular, we will look up the Malay words in a dictionary and then proceed to remove "extra suffix" as explained in Khan, Rehman Ullah, Fitri Suraya Mohamad, Muh Inam UlHaq, Shahren Ahmad Zadi Adruce, Philip Nuli Anding, Sajjad Nawaz Khan, and Abdulrazak Yahya Saleh Al-Hababi (2017) <https://ijrest.net/vol-4-issue-12.html> . This package includes a dictionary of Malay words that may be used to perform word stemming, a dataset of Malay stop words, a dataset of sentiment words and a dataset of normalized words.

Version: 0.1.3
Depends: R (≥ 2.10)
Imports: dplyr, magrittr, rlang, stringr
Suggests: rmarkdown, knitr, testthat (≥ 3.0.0)
Published: 2023-01-17
DOI: 10.32614/CRAN.package.malaytextr
Author: Zahier Nasrudin ORCID iD [aut, cre]
Maintainer: Zahier Nasrudin <zahiernasrudin at gmail.com>
BugReports: https://github.com/zahiernasrudin/malaytextr/issues
License: MIT + file LICENSE
URL: https://github.com/zahiernasrudin/malaytextr
NeedsCompilation: no
Materials: README NEWS
CRAN checks: malaytextr results

Documentation:

Reference manual: malaytextr.pdf
Vignettes: malaytextr

Downloads:

Package source: malaytextr_0.1.3.tar.gz
Windows binaries: r-devel: malaytextr_0.1.3.zip, r-release: malaytextr_0.1.3.zip, r-oldrel: malaytextr_0.1.3.zip
macOS binaries: r-release (arm64): malaytextr_0.1.3.tgz, r-oldrel (arm64): malaytextr_0.1.3.tgz, r-release (x86_64): malaytextr_0.1.3.tgz, r-oldrel (x86_64): malaytextr_0.1.3.tgz
Old sources: malaytextr archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=malaytextr to link to this page.