06-05-2014, 08:00 PM
Okay I'll write a section on that. Also I've started prototyping the pure C++ GPU rewriting system and it has some major disadvantages. You can't chain computations across functions without going back to main memory (expensive) or using c++14 features. I'm going to write a post on the pros and cons of the various approaches I've come up with so far.