上一条: DTATrans: Leveraging Dynamic Token-based Quantization with Accuracy Compensation Mechanism for Efficient Transformer Architecture
下一条: SME: ReRAM-based sparse-multiplication-engine to squeeze-out bit sparsity of neural network