VCSparse: Generate fused and vectorizable kernels for sparse or dense tensor programs in Python on CPUs (via Numba) and GPUs (via APPy).