Using tidy data principles can make many text mining tasks easier, more effective, and consistent with tools already in wide use. Much of the infrastructure needed for text mining with tidy data frames already exists in packages like 'dplyr', 'broom', 'tidyr', and 'ggplot2'. In this package, we provide functions and supporting data sets to allow conversion of text to and from tidy formats, and to switch seamlessly between tidy tools and existing text mining packages.
Version: | 0.4.2 |
Depends: | R (≥ 2.10) |
Imports: | cli, dplyr (≥ 1.1.1), generics, janeaustenr, lifecycle, Matrix, methods, purrr (≥ 0.1.1), rlang (≥ 0.4.10), stringr, tibble, tokenizers, vctrs |
Suggests: | broom, covr, data.table, ggplot2, hunspell, knitr, mallet, NLP, quanteda, readr, reshape2, rmarkdown, scales, stm, stopwords, testthat (≥ 2.1.0), textdata, tidyr, tm, topicmodels, vdiffr, wordcloud |
Published: | 2024-04-10 |
DOI: | 10.32614/CRAN.package.tidytext |
Author: | Gabriela De Queiroz [ctb], Colin Fay [ctb], Emil Hvitfeldt [ctb], Os Keyes [ctb], Kanishka Misra [ctb], Tim Mastny [ctb], Jeff Erickson [ctb], David Robinson [aut], Julia Silge [aut, cre] |
Maintainer: | Julia Silge <julia.silge at gmail.com> |
BugReports: | https://github.com/juliasilge/tidytext/issues |
License: | MIT + file LICENSE |
URL: | https://juliasilge.github.io/tidytext/, https://github.com/juliasilge/tidytext |
NeedsCompilation: | no |
Citation: | tidytext citation info |
Materials: | README NEWS |
In views: | NaturalLanguageProcessing |
CRAN checks: | tidytext results |
Reference manual: | tidytext.pdf |
Vignettes: |
Tidy Term Frequency and Inverse Document Frequency (tf-idf) Converting to and from Document-Term Matrix and Corpus objects Introduction to tidytext |
Package source: | tidytext_0.4.2.tar.gz |
Windows binaries: | r-devel: tidytext_0.4.2.zip, r-release: tidytext_0.4.2.zip, r-oldrel: tidytext_0.4.2.zip |
macOS binaries: | r-release (arm64): tidytext_0.4.2.tgz, r-oldrel (arm64): tidytext_0.4.2.tgz, r-release (x86_64): tidytext_0.4.2.tgz, r-oldrel (x86_64): tidytext_0.4.2.tgz |
Old sources: | tidytext archive |
Reverse imports: | akc, AnimalSequences, available, bibliometrix, CINE, crsra, dail, DistatisR, DOPE, epitweetr, ggpage, Goodreader, GSEAmining, iheiddown, LDAShiny, MadanText, MadanTextNetwork, madshapR, margaret, miaViz, miRetrieve, moodleR, naturaList, NIMAA, opitools, oRus, saotd, scicomptools, sherlock, statquotes, texter, TextForecast, TextMiningGUI, tidylda, tsentiment, TwitterAutomatedTrading, ulex, upstartr, vivainsights, WeatherSentiment, weed, widyr, wpa |
Reverse suggests: | bugphyzz, eurlex, funrar, gutenbergr, ibawds, LexisNexisTools, MetMashR, mvrsquared, newsanchor, openintro, polmineR, rfars, schrute, smartid, spacyr, spRingsteen, textmineR, tidylo, tidypmc, Twitmo, vosonSML |
Reverse enhances: | quanteda |
Please use the canonical form https://CRAN.R-project.org/package=tidytext to link to this page.