ml-generator
Creating the data source for Amazon Machine Learning from elasticsearch.
KunihikoKido 311 1 0.2.1 MIT GitHub

ml-generator package

Creating the data source for Amazon Machine Learning from elasticsearch.

You can create the context based categorization for multiclass classification data source.

Commands

  • Ml Generator: Update Statistics Target Terms
  • Ml Generator: Create

Tips

How to the contents of the file to random

Amazon Machine Learning used 70% of the data for training and 30% to evaluate the model based on the defaults.

  $ (head -n +1 sample.csv && tail -n +2 sample.csv | sort -R) > sorted_sample.csv

Equivalent of gnu sort -R on OSX?

See http://superuser.com/questions/334450/equivalent-of-gnu-sort-r-on-osx