Enabling Efficient Hybrid Systolic Computation in Shared L1-Memory Manycore Clusters (Full Report)