Skip to content

Conversation

@alexandre-jalgos
Copy link

  • Keeping term frequency data.table for all documents
  • Document term matrix computation from aforementioned data.table
  • weighting of term frequency based on previously learned weighting parameters docfreq and ndoc

Alexandre Tadros added 8 commits June 12, 2019 17:43
…x learned parameters, namely the document frequency and the dictionary. But it will be more flexible to implement this logic in the `TermDocumentMatrix.VCorpus` function to keep track of every term document frequency, and apply all filters and normalizations in prediction
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant