Currently the im2col operator only has a C implementation that uses the VpuSim API. We'd like an ASM implementation.