Blog Archives

OpenCL class & memory

Since I’ve been hitting the speed bottleneck for Radix a few days now, I don’t expect radix to improve. Organized openCL scrap into a simple class that includes rcl.init(…) & rcl.run(). Setups all the CL details and runs an included kernel. Next time I could allow parameters and code as generic inputs.

I’m reluctant rewriting the entire radix in C code, since the massive reliance on dynamic arrays is painful for C. If my school does openCL in C platform instead I’ll get to use C++ wrapper and dynamic data from STLs (hopefully works with CL?). The most gains now would be transform arithmetic-heavy, memory-lite lines into kernels.

float2int_2D(pts) range transform for float to int conversion:

pts 100k, pt_q 1000

t f2i pts_int_2D 17.2699999809 # Wow 1 looped line takes 30% of entire Radix!!
Read the rest of this entry