When repeating call LUonDevice() on large matrix such as ASIC680.mtx for benchmaring purpose, would causes illegal memory access.
Examing the code in numeric.cu found the variable tmpMem has not free at the end of file.
Soultion:
add cudaFree(tmpMem); at the end of numeric.cu.