RTextTools: A Supervised Learning Package for Text Classification - Sciences Po Accéder directement au contenu
Article Dans Une Revue The R Journal Année : 2013

RTextTools: A Supervised Learning Package for Text Classification

Timothy P. Jurka
  • Fonction : Auteur
Loren Collingwood
  • Fonction : Auteur
Amber E. Boydstun
  • Fonction : Auteur
van Atteveldt Wouter
  • Fonction : Auteur

Résumé

Social scientists have long hand-labeled texts to create datasets useful for studying topics from congressional policymaking to media reporting. Many social scientists have begun to incorporate machine learning into their toolkits. RTextTools was designed to make machine learning accessible by providing a start-to-finish product in less than 10 steps. After installing RTextTools, the initial step is to generate a document term matrix. Second, a container object is created, which holds all the objects needed for further analysis. Third, users can use up to nine algorithms to train their data. Fourth, the data are classified. Fifth, the classification is summarized. Sixth, functions are available for performance evaluation. Seventh, ensemble agreement is conducted. Eighth, users can cross-validate their data. Finally, users write their data to a spreadsheet, allowing for further manual coding if required.
Fichier principal
Vignette du fichier
collingwood-jurka-boydstun-etal.pdf (135.87 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-02186524 , version 1 (17-07-2019)

Licence

Paternité

Identifiants

Citer

Timothy P. Jurka, Loren Collingwood, Amber E. Boydstun, Emiliano Grossman, van Atteveldt Wouter. RTextTools: A Supervised Learning Package for Text Classification. The R Journal, 2013, 5 (1), pp.6-12. ⟨10.32614/rj-2013-001⟩. ⟨hal-02186524⟩
215 Consultations
278 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More