mlpack
blog
|
Dataset and Experimentation Tools - Week 7
This week, I:
DatasetMapper & Imputer
1) Applied the changes suggested, add more comments, and debugged DatasetMapper & Imputer pull request.
2) Made an overload for every imputation methods that receives only one input matrix as a paramter. The result will be overwritten to the input matrix, hopefully providing faster performance.
3) MedianImputation now excludes user-defined missing values and NaNs while it calculates the median.
4) New solution to implement ListwiseDeletion (suggested by rcurtin) is used.
Descriptive Statistics
Last week, I said I am going to work on statistics module. As a result I made a proof-of-concept work on this commit
I made a class called Statistics and put all the functions inside it. I think the Statistics class maybe useful for other things, too. so I am considering to separate the class from the executable and put it somewhere else independently.
Sample run on iris.csv shows the results like the below.
The output of this executable is similar to this application.
Generated by 1.8.13