[mlpack] google summer of code [GSoC]

Daniel Bell idaniel at me.com
Tue Apr 9 23:04:59 EDT 2013


oh ok thank you, I guess at my current level it is just outside my reach.  I am in my first algorithm course.  could you recommend something similar that might be more in my current skill set?

Thank you,
Daniel


On 2013-04-09, at 8:59 PM, Ryan Curtin <gth671b at mail.gatech.edu> wrote:

> On Tue, Apr 09, 2013 at 08:16:43PM -0600, Daniel Bell wrote:
>> I was intrigued by your improvement of tree traversers suggestion, I
>> am currently in a course that has a component on trees and so did some
>> work with them.  I am interested in learning about and learning
>> machine learning however I have never programmed in C++.  Do you think
>> this is within my skill set that I have laid out?
> 
> The tree traversal project is a particularly difficult one which expects
> rigorous C++ knowledge.  This does not mean that you could not do it,
> but in slightly more detail:
> 
> Many of the algorithms in mlpack are fast because they are tree-based
> branch-and-bound algorithms.  If you don't know about those you can read
> them about them online; search for something like 'nearest neighbor' and
> 'kd-trees'.
> 
> In addition, there are many different possible types of trees and many
> different machine learning algorithms.  mlpack uses C++ templates to
> provide abstract traversals and algorithms which can work with any type
> of tree (assuming the user implements the tree with the proper methods).
> This is actually very neat and to my knowledge there is no other library
> which implements anything even remotely similar.
> 
> However, the functionality is still somewhat experimental -- which is
> why the tree abstractions haven't been advertised as a main selling
> point.  The branch and bound type algorithms to which I was referring to
> earlier depend heavily on tight bounds.  So improving the effectiveness
> of these tree traversals will depend on the implementation of the bounds
> in the algorithms and the efficiency of those bound calculations (as
> well as other calculations).
> 
> So the project would involve both analyzing the existing traversals for
> either algorithmic or implementational improvements as well as examining
> the existing algorithms.
> 
> I could talk about this all day, given that it is my primary research
> interest, but I'll spare the gory details for now.
> 
> tl;dr: if you're a second-year undergraduate without heavy algorithmic
> knowledge and you're interested in this project, you probably are best
> served by putting in 30 hours a week reading papers both on dual-tree
> algorithms and learning C++ inside-out (metaprogramming does happen in
> the mlpack world of trees).  So, I'm not going to say it's impossible,
> but there's a reason its difficulty is rated a 9/10...
> 
> -- 
> Ryan Curtin       | "...I still don't know what it means."
> ryan at igglybob.com |   - Rigby Reardon




More information about the mlpack mailing list